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. 1 . 2^, Man 2d03 

Use of a secreted protein for preventing and treating obesity, 
diabetes and/or metabolic syndrome ^ 

Description 

This invention relates to the use of saposin-related proteins, to the use of 
polynucleotides encoding these, and to the use of effectors thereof in the 
diagnosis, study, prevention, and treatment of obesity and / or diabetes 
mellitus and/or metabolic syndrome. 

Many human proteins serve as pharmaceuticaily active compounds. 
Several classes of human proteins that serve as such active compounds 
include hormones, cytokines, cell growth factors, and cell differentiation 
factors. Most proteins that can be used as a pharmaceuticaily active 
compound fall within the family of secreted proteins. Secreted proteins are 
generally produced within cells at rough endoplasmic reticulum, are then 
exported to the golgi complex, and then move to secretory vesicles or 
granules, where they are secreted to the exterior of the cell via exocytosls. 
Examples for commercially used secreted proteins are human insulin, 
thrombolytic agents, interferons, interleukins, erythropoietins, colony 
stimulating factors, human growth hormone, transforming growth factor 
beta, tissue plasminogen activator, erythropoeitin, and various other 
proteins. Receptors of secreted proteins, which are membrane-bound 
proteins, also have potential as therapeutic or diagnostic agents. 

It is, therefore, important for developing new pharmaceutical compounds 
to identify secreted proteins that can be tested for activity in a variety of 
animal models. Thus, in light of the pervasive role of secreted proteins in 
human physiology, a need exists for identifying and characterizing novel 
functions for human secreted proteins and the genes that encode them. 
This knowledge will allow one to detect, to treat, and to prevent medical 



diseases, disorders, and/or conditions by using secreted .proteins or the 
genes that encode thenn. 

Obesity is one of the most prevalent metabolic disorders in the world. It is 
still a poorly understood human disease that becomes as a major health 
problem more and more relevant for western society. Obesity is defined as 
a body weight more than 20% in excess of the ideal body weight, 
frequently resulting in a significant impairment of health. Obesity may be 
measured by body mass index, an indicator of adiposity or fatness. Further 
parameters for defining obesity are waist circumferences, skinfold 
thickness and bioimpedance (see, inter alia, Kopelman (1999), loc. cit.). It 
is associated with an increased risk for cardiovascular disease, 
hypertension, diabetes mellitus type II, hyperlipidaemia and an increased 
mortality rate. 



Obesity is influenced by genetic, metabolic, biochemical, psychological, 
and behavioral factors and can be caused by different reasons such as 
non-insulin dependent diabetes, increase in triglycerides, increase in 
carbohydrate bound energy and low energy expenditure. As such, it is a 
complex disorder that must be addressed on several fronts to achieve 
lasting positive clinical outcome. Since obesity is not to be considered as 
a single disorder but as a heterogeneous group of conditions with 
(potential) multiple causes. Ft is also characterized by elevated fasting 
plasma insulin and an Exaggerated insulin response to oral glucose intake 
(Koltermann J., (1980) Clin. Invest 65, 1272-1284). A clear involvement 
of obesity In type 2 diabetes mellitus can be confirmed (Kopelman P.G., 
(2000) Nature 404, 635-643). 

Triglycerides and glycogen are used as the body's fuel energy storage. 
Glycogen is a large branched polymer of glucose residues that is mainly 
stored in liver and muscle cells. Glycogen synthesis and degradation is 
central to the control of the blood glucose level. Triglycerides are stored In 



the cytoplasm of adipocytes. Adipocytes are specialized for the synthesis, 
storage and mobilization of triglycerides. The glycogen and triglyceride 
metabolism is highly regulated and their interplay is essential for the energy 
homeostasis of the body. A high glucose level in the adipocytes results in 
the synthesis of triglycerides as fuel store. A low intracellular glucose level 
leads to a release of fatty acids, which can be used as substrates for the 
beta-oxidation to generate energy. Glycogen levels In cells are more 
variable than triglyceride levels because the turnover of glycogen is higher. 
Triglycerides are used as long term energy donors once the glycogen 
stores run low. 

Insulin amongst other hormones plays a key role in the regulation of the 
fuel metabolism. High blood glucose levels stimulate the secretion of 
insulin by pancreatic beta-cells. Insulin leads to the storage of glycogen 
and triglycerides and to the synthesis of proteins. The entry of glucose into 
muscles and adipose cells is stimulated by insulin. 

In patients who suffer from diabetes mellitus either the amount of insulin 
produced by the pancreatic islet cells is to low (diabetes type 1 or insulin 
dependent diabetes mellitus IDDM) or liver and muscle cells loose their 
ability to respond to normal blood insulin levels (insulin resistance). In the 
next stage pancreatic cells become unable to produce sufficient amounts 
of insulin (diabetes type 2 or non insulin dependent diabetes mellitus 
NIDDM). 

The concept of 'metabolic syndrome' (syndrome x, insulin-resistance 
syndrome, deadly quartet) was first described 1966 by Camus and 
reintroduced 1988 by Reaven (Camus JP, 1966, Rev Rhum Mai Osteoartic 
33(1):10-14; Reaven et al. 1988, Diabetes, 37(1 2):1 595-1 607). Today 
metabolic syndrome is commonly defined as clustering of cardiovascular 
risk factors like hypertension, abdominal obesity, high blood levels of 
triglycerides and fasting glucose as well as low blood levels of HDL 
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cholesterol. Insulin resistance greatly Increases the.risk of developing the 
metabolic syndrome <Reaven, 2002. Circulation 106(3): 286-288) The 
m«abo«c syndrome often precedes the development of type II diabetes 
and cardiovascular disease (IWcCook. 2002. JAMA 288:2709-2716) The 
. control of blood lipid levels and blood glucose levels is the essential for the 
treatment of the metabolic syndrome (see. for example. Santomauro A T 
etal.. (1999) Diabetes. 48(9):1836-1841). 

The molecular factors regulating food intake and body weight balance are 
incompletely understood. Even i, several candidate genes have been 
descnbed which are supposed .o Influence the homeostatfo sy«em(s) that 
regulate body mass/weight, like leptln. VCPI. VCPL or the peroxisome 
prol,ferator-activa.ed receptor-gamma o.-activetor. the distinct molecular 
meohan«ms and/or molecules influencing obesity or body weigln/body 
mass regulations are not known. 

Secreted proteins are a major target for drug action and development. 
Accordingly. , is valuable to the field of pharmaceutical development to 
.dentrtV and characterize novel functions for secreted proteins. The present 
mvention advances the state of the art by providing previously unknown 
functions for human secreted proteins. 

Accordingly, the present invention relates to a secreted protein with novel 
functions in the human metabolism. 7„e present invention discloses 
specif,c genes and proteins encoded thereby invoh,ed In the regulation of 
me^bolism. especially in «,e disorders ofth. metabolic syndrome, obesity, 
diabetes mellltus. coronary heart disease, eating disorder, cachexia 
hypenension. hypercholesterolemia (dyslipldemia). and/or gallstones. ' 

Therefore, the technical problem underlying the present invention was to 
provKie for means and methods for modulating (pathological) metabolic 
condrtions influencing body-weight regulation and/or energy homeostatic 
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circuits. The solution to said technical problem is achieved by providing the 
embodiments characterized in the claims. 

Accordingly, the present invention relates to genes with novel functions in 
5 body-weight regulation, energy homeostasis, metabolism, and obesity. The 
present invention discloses specific genes involved in the regulation of 
body-weight, energy homeostasis, metabolism, and obesity, as well as 
related diseases such as diabetes mellitus, eating disorder, cachexia, 
hypertension, coronary heart disease, hypercholesterolemia (dyslipidemia), 

10 and / or gallstones. The present invention describes saposin-related genes, 
their homologous genes and proteins encoded thereby. In particular human 
saposin-related genes and proteins (also referred to as prosaposin, variant 
Gaucher disease and variant metachromatic leukodystrophy; PSAP; human 
pulmonary surfactant-associated protein B (SFTPB); human hypothetical 

15 protein FLJ40379), respectively, as being involved in the conditions 
mentioned above. 

So far, it has not been described that the proteins of the invention and 
homologous proteins are involved in the regulation of energy homeostasis 
20 and body-weight regulation and related disorders, and thus, no functions fn 
metabolic diseases and dysfunctions and other diseases as listed above 
have been discussed. 



25 



30 



In this invention we' refer to the proteins encoded by Drosophila 
Saposin-related genes and homologous proteins, preferably human and 
murine homologous polypeptides or proteins or sequences encoding these 
proteins as saposin-related or as proteins of the invention. 

The present invention discloses that saposin-related proteins are regulating 
the energy homeostasis and fat metabolism especially the metabolism and 
storage of triglycerides and glycogen, and polynucleotides, which Identify 
and encode the proteins disclosed in this invention. The Invention also 
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relates to vectors, host cells, antibodies, and recombinant methods for 

producins the polypeRtides and polynucleotides Of the invention The 
-nvention also relates to the use of «,ese polynucleotides, polypeptides and 
effectors «,ereof in the diagnosis, study, prevention, and tr.atn»nt of 
metabolic diseases or dysfunctions, for example, but not limited to 
metabolic syndrome, obesity, diabetes mellitus. eatinB disorder, cachexia' 
hypertension, coronary heart disease, hypercholesterolemia (dysllpldemla)' 
and / or gallstones. 

Saposin-related homologous -proteins and nucleic acid molecules coding 
«.erefore are obtainable from insect or vertebrate species, e.g. mammals or 
birds. Particularly preferred are nucleic acids encoding «,e human 
aaposin^elated homologs and «,e proteins encoded then,by fin particular 
human prosaposin (PSAP), human pulmonary surfactant-associated protein 
B (SFTPB). and/or human hypotheticel protein FU40379). 

The glycoprotein Prosaposin (PSAP) is a precursor for four saposlns (A B 
C, and D) found in different cellular locations (Morimoto S. et al (1989)' 
Proc. Nat. Acad. Scl. 86: 3389-3393). Saposlns are small ly«,somel 
proteins that serve as activators of various lysosomal llpid-d.gradlna 
enzymes (Munford R.S. et al.. (1995) J. Lipid Res. 36(8): 1663-1663) 
Prosaposin occurs in various kinds of human secretory fluids such as 
cerebrospinal fluid, semen, milk, pancreatic Juice, and bile. Prosaposin is 
postulated to mediate neurotrophic signeUng «,ents capable of Indueino 
neural differentiation and preventing ceU death. 

A mutant mouse line in which the sphingolipid activator protein gene has 
been inacSvated by homologous recombination technology showed 
complex clinical, pathologic, end biochemloal abnormalities similar to those 
of human patients with total saposin deficiency (Fujita N. et al., (1996) 
Hum Mol Genet. 5(6):711-725.). The main pathology in the brain of 
affected Sap -/- mice was hypomyelination and storage of ceramldes and 
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gangliosides. Ceramide storage occurred atso in brain, liver, and kidney, 
and ceramide catabolism is abnormally slow in fibroblasts. Examination of 
reproductive organs In prosaposin homozygous mutant males shows 
several abnormalities, indicating that prosaposin is also involved in the 
development and maintenance of male reproductive organs (Morales C.R. 
et al., (2000) J Androl 21(6):765-775) 

Pulmonary surfactant is a lipid-rich material that prevents lung collapse by 
lowering surface tension at the air-liquid interface in the alveoli of lung. It 
is composed primarily of phospholipids, cholesterol and proteins, including 
four surfactant associated proteins, two collagenous, carbohydrate-binding 
glycoproteins (PSP-A and PSP-D), and two small hydrophobic proteins 
(PSP-B and PSP-C). Pulmonary surfactant proteins are involved In 
respiration, and promote alveolar stability by lowering the surface tension 
at the air-liquid interface in the peripheral air spaces. PSP-B is a clinically 
important, developmentally regulated gene. Deficiency of PSPB B was 
demonstrated in congenital alveolar proteinosis (Nogee et al., (1993) N 
Engl J Med. 328(6) :406-410). 



The Invention particularly relates to a nucleic acid molecule encoding a 
polypeptide contributing to regulating the energy homeostasis and the 
metabolism of triglycerides and glycogen, wherein said nucleic acid 
molecule comprises 

(a) the nucleotide sequence of the Drosophila saposin-related gene or a 
mammalian, e.g. human saposin-related homolog (in particular 
human PSAP, SFTPB and/or FU40379), and/or a sequence 
complementary thereto, 

(b) a nucleotide sequence which hybridizes at 50*'C in a solution 
containing 1 x SSC and 0.1% SDS to a sequence of (a), 

(c) a sequence corresponding to the sequences of (a) or (b) within the 
degeneration of the genetic code. 



(d) 



(e) 



a sequence which encodes a polypeptide which is at least 85o/o 
preferably at least 90o^, more preferably at least 95o/o, more' 
preferably at least 98o/o and up to 99,6% identical to the amino acid 
sequences of a saposin-related protein, preferably a mammalian, 
e.g. human saposin-related homolog (in particular PSAP, SFTPB 
and/or FLJ40379), 

a sequence which differs from the nucleic acid molecule of (a) to (d) 
by mutation and wherein said mutation causes an alteration 
deletion, duplication and/or premature stop in the encoded 
polypeptide or • 

a partial sequence of any of the nucleotide sequences of (a) to (e) 
having a length of at least 15 bases, preferably at least 20 bases 
more preferably at least 25 bases and most preferably at least 50 
bases. 

The present invention relates to genes with novel functions in body-welght 
regulation, energy homeostasis, metabolism, and obesity, fragments of 
said genes, polypeptides encoded by said genes or fragments thereof, and 
effectors e.g. antibodies, biologically active nucleic acids, such as 
antisense molecules, RNAi molecules or ribozymes, aptamers, peptides or 
low-molecular weight organic compounds recognizing said polynucleotides 
or polypeptides. 



(f) 



The ability to manipulate and screen tiie genomes of model organisms such 
as the fly Drosophila melanogaster provides a powerful tool to analyze 
biological and biochemical processes that have direct relevance to more 
complex vertebrate organisms due to significant evolutionary conservation 
of genes, cellular processes, and pathways (see, for example, Adams M 
D. et al., (2000) Science 287: 2185-2195). Identification of novel gene 
functions In model organisms can directly contribute to the elucidation of 
correlative pathways in mammals (humans) and of methods of modulating 
them. A correlation between a pathology model (such as changes In 



triglyceride levels as indication for metabolic syndrome including obesity) 
and the modified expression of a fly gene can identify the association of 
the human ortholog with the particular human disease. 

In one embodiment, a forward genetic screen is performed in fly displaying 
a mutant phenotype due to misexpression of a known gene (see, Johnston 
Nat Rev Genet 3: 176-188 (2002); Rorth P., (1996) Proc Natl Acad Sci U 
S A 93: 12418-12422). In this invention, we have used a genetic screen 
to identify mutations of Saposin-related homologous genes that cause 
changes in the body weight, which are reflected by a significant change of 
triglyceride levels. Additionally glycogen levels are analysed. 

One resource for screening was a Drosophila melanogaster stock collection 
of EP-lines. The P-vector of this collection has Gal4-UAS-binding sites 
fused to a basal promoter that can transcribe adjacent genomic Drosophila 
sequences upon binding of Gal4 to UAS-sites (Brand & Perrimon (1993) 
Development 118:401-415; Rorth P., supra). This enables the EP-line 
collection for overexpression of endogenous flanking gene sequences. In 
addition, without activation of the UAS-sites, integration of the EP-element 
into the gene is likely to cause a reduction of gene activity, and allows 
determining its function by evaluating the loss-of-function phenotype. 

To isolate genes with a function in energy homeostasis, several thousand 
EP-lines were tested' for their triglyceride/glycogen content after a 
prolonged feeding period (see Examples for more detail). Lines with 
significantly changed triglyceride/glycogen content were selected as 
positive candidates for further analysis. The change of 
triglyceride/glycogen content due to the loss of a gene function suggests 
gene activities in energy homeostasis in a dose dependent manner that 
control the amount of energy stored as triglycerides or glycogen. 
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In this invention, the content of triglycerides and glycogen of a pool of flies 
with the same genotype was analyzed after feeding for six days using a 
triglyceride and a glycogen assay. Male flies homozygous for the 
integration of vectors for Drosophila lines HD.EP(3)36824 were analyzed in 
assays measuring the triglyceride/glycogen contents of these flies 
(Illustrated in more detail in the Examples section). The results of the 
triglyceride/glycogen content analysis are shown in Figure 1. 

Genomic DNA sequences were Isolated that are localized directly adjacent 
to. the EP vector (herein HD.EP(3)36824) integration. Using those isolated 
genomic sequences public databases like Berkeley Drosophila Genome 
Project (GadRy; see also FlyBase (1 999) Nucleic Acids Research 27:85-88) 
were screened thereby identifying the integration site of the vectors, and 
the corresponding gene, described in more detail in the Examples section. 
The molecular organization of the gene is shown in Figures 2. 

The Drosophila genes and proteins encoded thereby with functions in the 
regulation of triglyceride metabolism were further analysed in publicly 
available sequence databases (see Eamples for more detail) and mammalian 
homoiogs were identified (see Figure 3). 

In addition, we identified prosaposm (PSAP) as secreted factor expressed 
.n developing mobse pancreas, as described in more detail in the Examples 
section. 

The function of the mammalian homoiogs In energy homeostasis was 
further validated in this invention by analyzing the expression of the 
transcripts In different tissues and by analyzing the role in adipocyte 
differentiation. 

Expression profiling studies (see Examples for more detail) confirm the 
particular relevance of the protein of the invention as regulators of energy 
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metabolism in mammals. Transcripts of prosaposin are found in several 
tissues of mammals with high expression levels in brain tissues 
(hypothalamus), in kidney, heart and spleen. In addition, prosaposin shows 
high expression in brown adipose tissue (BAT) and white adipose tissue 
(WAT) (see FIGURE 4A)- Brown adipose tissue is a well characterized 
tissue which is well developed in newborn mammals, including humans. 
One important task of BAT is to generate heat and maintain body 
temperature homeostasis in newborn. Thus, an expression of the protein of 
the invention in adipose tissues is confirming a role in the regulation of 
energy homeostasis and thermogenesis. 

Further, we show that mammalian prosaposin is regulated by fasting and 
by genetically induced obesity. In this invention, we used mouse models of 
insulin resistance and/or diabetes, such as mice carrying gene knockouts in 
the leptin pathway (for example, ob (leptin) or db (leptin receptor) mice) to 
study the expression of prosaposin. Such mice develop typical symptoms 
of diabetes, show hepatic lipid accumulation and frequently have increased 
plasma lipid levels (see Bruning et al, 1998, Mol. Cell. 2:449-569). We 
found, for example, that the expression of prosaposin is strongly 
upregulated in liver of fasted and ob/ob mice (see Figure 4B). In addition, 
a marked upregulation can be observed In the metabolically active tissue 
(for example, white adipose tissue (WAT)) of genetically obese (ob/ob) 
(see FIGURE 4B). 

Susceptible wild type mice (for example C57BI/6) show symptoms of 
diabetes, lipid accumulation, and high plasma lipid levels, if fed a high fat 
diet. In such mice, the most prominent response with regard to 
metabolically active tissues was observerd. In those mice, the expression 
of prosaposin is significantly enhanced in white adipose tissue (see Figure 
4C) and in liver and muscle tissues, supporting a hypothesis that the 
protein of the Invention is a modulator of adipogenesis. 



In addition, we show in this invention that the the protein of the invention 
mRNA Is up-regulated during adipocyte differentiation in vitro (see 
EXAMPLES for more detail), suggesting a role as modulator of adipocyte 
lipid accumulation. With regard to changes In expression Intensity during 
the differentiation of preadipocytes to adipocytes, a enhancement In 
relative signal intensity can be observed for the protein of the Invention 
during the In vitro differentiation program of 3T3-L1 (see Figure 4D). Thus, 
we conclude tiiat prosaposin or variants or processing products thereof 
have a function In the metabolism of mature adipocytes. 

Microarrays are analytical tools routinely used in bloanalysis. A microarray 
has molecules distributed over, and stably associated with, the surface of 
a solid support. The term "microarray" refers to an arrangement of a 
plurality of polynucleotides, polypeptides, antibodies, or other chemical 
compounds on a substrate. Microarrays of polypeptides, polynucleotides, 
and/or antibodies have been developed and find use In a variety of 
applications, such as monitoring gene expression, drug discovery, gene 
sequencing, gene mapping, bacterial identification, and combinatorial 
chemistry. One area In particular in which microarrays find use is in gene 
expression analysis (see Example 4). Array technology can be used to 
explore the expression of a single polymorphic gene or the expression 
profile of a large number of related or unrelated genes. When the 
expression of a single gene is examined, arrays are employed to detect the 
expression of a specific gene or its variants. When an expression profile Is 
examined, arrays provide a platform for identifying genes that are tissue 
specific, are affected by a substance being tested In a toxicology assay, 
are part of a signaling cascade, carry out housekeeping functions, or are 
specifically related to a particular genetic predisposition, condition, disease, 
or disorder. 

Microarrays may be prepared, used, and analyzed using methods known in 
the art (see for example, Brennan, T.M. et al. (1995) U.S. Patent No. 



5,474,796, Schena, M. et al. (1996) Proc. Natl. Acad. Sci. USA 
93:10614-10619; Baldeschweiler et al. {1995) PCT application 
W095/251116; Shalon. D. et al. (1995) PCT application WO95/35505; 
Heller, R.A. et al. (1 997) Proc. Natl. Acad. Sci. USA 94:21502155; Heller, 
M.J. et al. (1997) U.S. Patent No. 5,605,662). Various types of 
microarrays are well known and thoroughly described in Schena, M., ed. 
(1999; DNA Microarrays: A Practical Approach, Oxford University Press, 
London). 

In further embodiments, oligonucleotides or longer fragments derived from 
any of the polynucleotides described herein may be used as elements on a 
microarray. The microarray can be used in transcript imaging techniques 
which monitor the relative expression levels of large numbers of genes 
simultaneously as described below. The microarray may also be used to 
identify genetic variants, mutations, and polymorphisms. This information 
may be used to determine gene function, to understand the genetic basis 
of a disorder, to diagnose a disorder, to monitor progression/regression of 
disease as a function of gene expression, and to develop and monitor the 
activities of therapeutic agents in the treatment of disease. In particular, 
this information may be used to develop a pharmacogenomic profile of a 
patient in order to select the most appropriate and effective treatment 
regimen for that patient. For example, therapeutic agents which are highly 
effective and display the fewest side effects may be selected for a patient 
based on his/her pharmacogenomic profile. 

As determined by Microarray analysis, prosaposin shows differential 
expression in human primary adipocytes. A strong downregulation is 
observed concerning the expression of prosaposin is during the human 
adipocyte differentiation (see FIGURE 5). The prosaposin protein in 
preadipocyctes has the potential to inhibit adipose differentiation at a very 
early stage. Therefore, the prosaposin protein might play an essential role 
in adipogenesis. The results are suggesting a role of prosaposin in the 
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.nh.b.,or. „, aCpogenesi,. Thus, prosaposin i, a strong candidate for a 
Pharmac«.tica. composition for the treatment of conditions related to 
human metabolism, such as obesity, diabetes, and/or metabolic syndrome. 

The invention also encompasses polynucieotides that encode the proteins 
Of the mven,,on and homoiogous proteins. Accordingly, any nuciL acid 
sequence, which encodes the amino acid sequences of «,e proteins ,1 
.nv nt,„„ 3„d homologous proteins, can be used to generate recombinant 
mo ecules that express the ,rot.ins of the invention and homologous 
Prote,ns. In a particular embodiment the inven«on encompasses a nuLc 
ac^ encod,ng orosophila sapos.n-rela.ed. or their mammalian, e.g. human 
homologs; referred to herein as the proteins of the invention. ,. wil, be 
apprecated by d,ose sWilod in the .rt«,a. as a result of the degeneracy of 
the genefc code, a muWtude of nucleotide sequences encoding the 
prote,ne. some bearing minimal homology to the nucleotide sequences of 
any Known and naturally occurring gene, may be produced. The invention 

contempleteseachandevery possible vanationofnucleotidesequence that 
can be made by setecting combinations based on possible codon choices. 

Also encompassed by the invention are polynucleotide sequences tt,at ar« 
capabi. Of hybridizing to the claimed nuc,eo«de sequences. Id ,n 
parfcular. those of the polynucleotide encoding «,e proteins of the 
■nvenfon under variou. conditions of stringency. Hybridization conditions 
are based on the melting tempe^ture ,Tm, of the nuCeic add binding 
complex or probe, as taught in Wahl «. Berger (1987: Mett,od. Enzymol 
152:399-407, and Kimme. ,1987; Methods Enzymol. , 52:507-5, „ Id 
may be used at a defined stringency. Preferably, hybrWization Jnder 
str-ngent cond«ions means «,at after washing for 1 h with 1 x SSC and 
0.1 * SOS a. SO-C, preferably at 55«C. more preferably at 62oc and most 
preferably at 65-0. particulariy for , h in 0.2 x SSC and 0.1% SOS a 
50 C. preferably at 55-C. more preferably at 62»C and most preferably at 



65 °C, a positive hybridization signal is observed. Altered nucleic acid 
sequences encoding the proteins which are encompassed by the invention 
include deletions, insertions or substitutions of different nucleotides 
resulting in a polynucleotide that encodes the same or a functionally 
equivalent protein. 

The encoded proteins may also contain deletions, insertions or 
substitutions of amino acid residues, which produce a silent change and 
result in functionally equivalent proteins. Deliberate amino acid 
substitutions may be made on the basis of similarity in polarity, charge, 
solubility, hydrophobicity, hydrophiliclty, and/or the amphipathic nature of 
the residues as long as the biological activity of the protein is retained. 
Furthermore, the invention relates to peptide fragments of the proteins or 
derivatives thereof such as cyclic peptides, retro-inverso peptides or 
peptide mimetics having a length of at least 4, preferably at least 6 and up 
to 50 amino acids. 

Also included within the scope of the present invention are alleles of the 
genes encoding the proteins of the invention and homologous proteins. As 
used herein, an 'allele' or 'allelic sequence' is an alternative form of the 
gene, which may result from at least one mutation in the nucleic acid 
sequence. Alleles may result in altered mRNAs or polypeptides whose 
structures or function may or may not be altered. Any given gene may 
have none, one or many allelic forms. Common mutational changes, which 
give rise to alleles, are generally ascribed to natural deletions, additions or 
substitutions of nucleotides. Each of these types of changes may occur 
alone or in combination with the others, one or more times In a given 
sequence. 

The nucleic acid sequences encoding the proteins of the invention and 
homologous proteins may be extended utilizing a partial nucleotide 



sequence and employing various methods known in the art to 
upstream sequences such as promoters and regulatory elements. 



In order to express a biologically active protein, the nucleotide sequences 
encoding the proteins or functional equivalents, may be inserted into 
appropriate expression vectors, i.e., a vector which contains the necessary 
elements for the transcription and translation of the inserted coding 
sequence. Methods, which are well known to those skilled in the art, may 
be used to construct expression vectors containing sequences encoding 
the proteins and the appropriate transcriptional and translational control 
elements. Regulatory elements Include for example a promoter, an Initiation 
codon, a stop codon, a mRNA stability regulatory element, and a 
polyadenylation signal. Expression of a polynucleotide can be assured by (1) 
constitutive promoters such as the Cytomegalovirus (CMV) 
promoter/enhancer region, (ii) tissue specific promoters such as the insulin 
promoter {see, Soria et al., 2000, Diabetes 49:1 57), SOX2 gene promoter 
(see Li et al., (1998) Curr. Biol. 8:971-974), Msi-1 promoter (see 
Sakakibara et al., (1997) J. Neuroscience 17:8300-8312), alpha-cardia 
myosin heavy chain promoter or human atrial natriuretic factor promoter 
(Klug et al., (1996) J. din. Invest 98:216-224; Wu et al., (1989) J. Biol 
Chem. 264:6472-6479) or (iii) inducible promoters such as the tetracycline 
inducible system. Expression vectors can also contain a selection agent or 
marker gene that confers antibiotic resistance such as the neomycin, 
hygromycin or puromycin resistance genes. These methods include in vitro 
recombinant DNA techniques, synthetic techniques, and in vivo genetic 
recombination. Such techniques are described in Sambrook, J. et al. 
(1 989) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, 
Plainview, N.Y. and Ausubel, P.M. et al. (1989) Current Protocols iri 
Molecular Biology, John Wiley & Sons, New York, N.Y. 

In a further embodiment of the invention, natural, modified or recombinant 
nucleic acid sequences encoding the proteins of the invention and 
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homologous proteins may be llgated to a heterologous sequence to encode 
a fusion protein. 

A variety of expression vector/liost systems may be utilized to contain and 
express sequences encoding the proteins or fusion proteins. These include, 
but are not limited to, micro-organisms such as bacteria transformed with 
recombinant bacteriophage, plasmid or cosmid DNA expression vectors; 
yeast transformed with yeast expression vectors; Insect cell systems 
infected with virus expression vectors (e.g., baculovirus, adenovirus, 
adeno-associated virus, lentiverus. retrovirus); plant cell systems 
transformed with virus expression vectors (e.g., cauliflower mosaic virus, 
CaMV; tobacco mosaic virus, TMV) or with bacterial expression vectors 
(e.g., Ti or PBR322 plasmids); or animal cell systems. 

The presence of polynucleotide sequences of the invention in a sample can 
be detected by DNA-DNA or DNA-RNA hybridization and/or amplification 
using probes or portions or fragments of said polynucleotides. Nucleic acid 
amplification based assays involve the use of oligonucleotides or oligomers 
based on the sequences specific for the gene to detect transformants 
containing DNA or RNA encoding the corresponding protein. As used 
herein 'oligonucleotides' or 'oligomers' refer to a nucleic acid sequence of 
at least about 10 nucleotides and as many as about 60 nucleotides, 
preferably about 1 5 to 30 nucleotides, and more preferably about 20-25 
nucleotides, which can be used as a probe or amplimer. 

A wide variety of labels and conjugation techniques are known by those 
skilled In the art and may be used In various nucleic acid and amino acid 
assays. Means for producing labeled hybridization or PGR probes for 
detecting polynucleotide sequences include oligo-labeling, nick translation, 
end-labeling of labeled RNA probes, PGR amplification using a labeled 
nucleotide, or enzymatic synthesis. These procedures may be conducted 
using a variety of commercially available kits (Pharmacia & Upjohn, 



(Kalamazoo, Mich.); Promega (Madison Wis.); and U.S. Biochemical Coro 
(Cleveland, Ohio). 

The presence of proteins of the invention in a sample can be determined by 
.mmunological methods or activity measurement. A variety of protocols for 
detecting and measuring the expression of proteins, using either polyclonal 
or monoclonal antibodies specific for the protein or reagents for 
determining protein activity are known in the art. Examples include 
enzyme-lmked immunosorbent assay (ELiSA), radioimmunoassay (RIA) and 
fluorescence activated cell sorting (FACS). A two-site, monoclonal-based 
immunoassay utilizing monoclonal antibodies reactive to two 
non-interfering epitopes on the protein Is preferred, but a competitive 
binding assay may be employed. These and other assays are described 
among other places, in Hampton, R. et al. (1990; Serological Methods a' 
Laboratory Manual, APS Press, St Paul, Minn.) and Maddox, D. E. et al. 
(1983; J. Exp. Med. 158:1211-1216). 

Suitable reporter molecules or labels, which may be used. Include 
radionuclide,, enzymes, fluorescent, chemilumlnescent or chromogenic 
agents as well as substrates, co-factors, inhibitors, magnetic particles, and 

the like. 

The nucleic acids encoding the proteins of the invention can be used to 
generate transgenic anfmal or site specific gene modifications in cell lines 
Transgenic animals may be made through homologous recombination 
where the nomial locus of the genes encoding the proteins of the Inventlori 
IS altered. Alternatively, a nucleic acid construct is randomly integrated into 
the genome. Vectors for stable integration include plasmids. retrovirusses 
and other animal virusses, YACs, and the like. The modified cells or animal 
are useful in the study of the function and regulation of the proteins of the 
invention. For example, a series of small deletions and/or substitutions may 
be made In the genes that encode the proteins of the invention to 



determine the role of particular domains of the protein, functions in 
pancreatic differentiation, functions in adipocyte differentiation, etc. 



Specific constructs of interest include anti-sense molecules, which will 
block the expression of the proteins of the invention, or expression of 
dominant negative mutations. A detectable marker, such as for example 
lac-Z, may be introduced in the locus of the genes of the invention, where 
upregulation of expression of the genes of the invention will result in an 
easily detected change in phenotype. 

One may also provide for expression of the genes of the invention or 
variants thereof in cells or tissues where it is not normally expressed or at 
abnormal times of development. In addition, by providing expression of the 
proteins of the invention in cells In which they are not normally produced, 
one can induce changes in cell behavior. 

DNA constructs for homologous recombination will comprise at least 
portions of the genes of the invention with the desired genetic 
modification, and will include regions of homology to the target locus. DNA 
constructs for random integration need not include regions of homology to 
mediate recombination. Conveniently, markers for positive and/or negative 
selection are included. Methods for generating cells having targeted gene 
modifications through homologous recombination are known in the art. For 
embryonic stem (ES) cells, an ES cell line may be employed, or embryonic 
cells may be obtained freshly from a host, e.g. mouse, rat, guinea pig etc. 
Such cells are grown on an appropriate fibroblast-feeder layer or grown in 
presence of leukemia inhibiting factor (LIF). 

When ES or embryonic cells or somatic pluripotent stem cells have been 
transformed, they may be used to produce transgenic animals. After 
transformation, the cells are plated onto a feeder layer in an appropriate 
medium. Cells containing the construct may be detected by employing a 



selective medium. After sufficient time for colonies to grow they ere 
picked end analyzed for the occurrence of homoloflous recombination or 
.ntegration of the construct. Those colonies that are positive may then be 
used tor embryo manipulation and blastocyst injection. Blastocysts are 
obtained from 4 to 6 v,eek old superovulated females. The ES cells are 
trypslnlzed, end the modified cells ere injected into the blastocoel of the 
blastocyst. After injection, the blastocysts are returned to each uterine 
hom of pseudopregnent females. Females are then allowed to go to term 
end the resulting offspring screened for the construct. By providing for a 
d^erent phenotype of the blastocyst and the genetically modified ceHs 
ch,meric progeny can be readily detected. The ohimarlo animals are 
screened for the presence of the modified gene and males and females 
having the modMcetion are mated to produce homozygous progeny. If 
gene elterationa cause lethality at some point in development, tissues or 
organs can be maintained as allogenio or congenic grafts or transplants or 
». vitro culture. The trensgenic animals may be any non-human mammal 
such as laboratory animal, domestic animals, etc. The transgenic animals 
may be used In functional studies, drug screening, etc. 



Diagnostics and Therapeutics 

The data disclosed in this invention show that the nucleic acids and 
proteins of the invention and effector molecules thereof are useful in 
diagnostic and therapeutic applications implicated, for example, but no. 
hmited to. metabolic syndrome including obesitv, diabetes mellitus, eating 
disorder, cachexia, hypertension, coronary heart disease 
hypercholesterolemia (dysiipidamia). end / or gallstones. Hence, diagnostic 
and therapeutic uses for the proteins of the invantion nucleic acids and 
proteins of the invention are, for example but not limited to, the following: 
0 protein therapeutic, (ii) small molecul. drug terge,. (iii, antibody target 
(therepeutio. diagnostic, drug targeting/cytotoxic antibody), (iv) diagnostic 
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and/or prognostic marker, |v) gene therapy {gene delivery/gene ablation), 
(vi) research tools, and (vii) tissue regeneration in vitro and in vivo 
(regeneration for all these tissues and cell types composing these tissues 
and cell types derived from these tissues). 

The nucleic acids and proteins of the invention and effectors thereof are 
useful in diagnostic and therapeutic applications implicated in various 
applications as described below. For example, but not limited to, cDNAs 
encoding the proteins of the invention and particularly their human 
homologues may be useful Jn gene therapy, and the proteins of the 
invention and particularly their human homologues may be useful when 
administered to a subject in need thereof. By way of non-limiting example, 
the compositions of the present invention will have efficacy for treatment 
of patients suffering from, for example, but not limited to, in metabolic 
disorders as described above. 

The nucleic acids of the invention or fragments thereof, may further be 
useful in diagnostic applications, wherein the presence or amount of the 
nucleic acids or the proteins are to be assessed. Further antibodies that 
bind immunospecifically to the novel substances of the invention may be 
used in therapeutic or diagnostic methods. 

For example, in one aspect, antibodies, which are specific for the proteins 
of the invention and homologous proteins, may be used directly as an 
effector, e.g. an antagonist or indirectly as a targeting or delivery 
mechanism for bringing a pharmaceutical agent to cells or tissue which 
express the protein. The antibodies may be generated using methods that 
are well known in the art. Such antibodies may include, but are not limited 
to, polyclonal, monoclonal, chimeric single chain. Fab fragments, and 
fragments produced by a Fab expression library. Neutralising antibodies, 
(i.e., those which inhibit dimer formation) are especially preferred for 
therapeutic use. 
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For the production of antibodies, various hosts including goats rabbits 
rats, mice, humans, and others, may be immunized by injection with the 
protein or any fragment or oligopeptide thereof which has immunogenic 
properties. Depending on the host species, various adjuvants may be used 
to increase immunological response. It is preferred that the peptides 
fragments or oligopeptides used to induce antibodies to the protein have an 
ammo acid sequence consisting of at least five amino acids, and more 
preferably at least 10 amino acids. 

Monoclonal antibodies to t^e proteins may be prepared using any 
technique that provides for the production of antibody molecules by 
continuous cell lines in culture. These Include, but are not limited to the 
hybndoma technique, the human B-cell hybrldoma technique, and the 
EBV-hybridoma technique (Kohler, G. et al. (1975) Nature 256:495-497. 

Kozbor, D.etal. (1985) J. Immunol. Methods 81:31-42; Cote R J etal' 
Proc. Natl. Acad. Scl. 80:2026-2030; Cole, S. P. et al. (1984) Mol. CeH 
Biol. 62:108-120). 

In addition, teohnlques developed for the production of -cl,imerio 
antibodies-, the splioinfl of mouse antibody aenes to human antibody genes 
to Obtain a molecule with appropriate antigen speoificitv and biological 
aotn,ity can be used (Morrison, S. L. et al. (1984) Prco. Na«. Acad Sci 
81:6851-6865; Neuberger, IVI. S. et al (1984) Nature 312:604-608- 
Talceda, S. at al. (1985) Nature 314:452-454). Alternatively, technique 
desonbed for the production of single chain antibodies may be adapted 
using methods known in the art. to produce single chain antibodies specific 
for the proteins of the invention and homologous proteins. Antibodies wi«, 
related specificity, but of distinct Idiotypic composition, may be generated 
by chain Shuffling from random combinatorial immunoglobulin libraries 
(Burton. D. R. ,1991) i=roc. Natl. Acad. Scl. 88:1 1 120-1 1 123). Antibodies 
may also be produced by inducing in vivo production in the lymphocyte 
population or by screening recombinant immunoglobulin libraries or panels 
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of htghly specific binding reagents as disclosed In the literature (Orlandl, R. 
etal. <1989) Pi-oc. Natl. Acad. Sci. 86:3833-3837; Winter, G. et ai. (1991) 
Nature 349:293-299). 

Antibody fragments, which contain specific binding sites for the proteins 
may also be generated. For example, such fragments include, but are not 

ft 

limited to, the F(ab')2 fragments which can be produced by Pepsin 
digestion of the antibody molecule and the Fab fragments which can be 
generated by reducing the disulfide bridges of F{ab')2 fragments. 
Alternatively, Fab expression- libraries may be constructed to allow rapid 
and easy identification of monoclonal Fab fragments with the desired 
specificity (Huse, W. D. et al. (1989) Science 254:1275-1281). 

Various immunoassays may be used for screening to identify antibodies 
having the desired specificity. Numerous protocols for competitive binding 
and immunoradfometric assays using either polyclonal or monoclonal 
antibodies with established specificities are well known in the art. Such 
immunoassays typically involve the measurement of complex formation 
between the protein and its specific antibody. A two-site, 
monoclonal-based immunoassay utilizing monoclonal antibodies reacive to 
two non-interfering protein epitopes are preferred, but a competitive 
binding assay may also be employed (Maddox, supra). 

In another embodiment of the invention, the polynucleotides or fragments 
tiiereof or nucleic acid effector molecules such as antisense molecules, 
aptamers, RNAi molecules or ribozymes may be used for therapeutic 
purposes. In one aspect, aptamers. I.e. nucleic acid molecules, which are 
capable of binding to a protein of the invention and modulating Its activity, 
may be generated by a screening and selection procedure involving the use 
of combinatorial nucleic acid libraries. 



in a further aspect, antisense rnolecules may be used in situatioris in which 
It would be desirable to block the transcription of the mRNA. In particular, 
cells may be transformed with sequences complementary to 
polynucleotides encoding the proteins of the invention and homologous 
proteins. Thus, antisense molecules may be used to modulate protein 
activity or to achieve regulation of gene function. Such technology is now 
well known in the art, and sense or antisense oligomers or larger 
fragments, can be designed from various locations along the coding or 
control regions of sequences encoding the proteins. Expression vectors 
derived from retroviruses, adenovirus, herpes or vaccinia viruses or from 
various bacterial plasmids may be used for delivery of nucleotide 
sequences to the targeted organ, tissue or cell population. Methods, which 
are well known to those skilled in the art, can be used to construct 
recombinant vectors, which will express antisense molecules 
complementary to the polynucleotides of the genes encoding the proteins 
of the invention and homologous proteins. These techniques are described 
both in Sambrook et al. (supra) and in Ausubel et al. (supra). Genes 
encoding the proteins of the invention and homologous proteins can be 
turned off by transforming a cell or tissue with expression vectors, which 
express high levels of polynucleotides that encode the proteins of the 
invention and homologous proteins or fragments thereof. Such constructs 
may be used to introduce untranslatable sense or antisense sequences into 
a cell. Even in the absence of integration Into the DNA, such vectors may 
continue to transcribe RNA molecules until they are disabled by 
endogenous nucleases. Transient expression may last for a month or more 
with a non-replicating vector and even longer if appropriate replication 
elements are part of the vector system. 

As mentioned above, modifications of gene expression can be obtained by 
designing antisense molecules, e.g. DNA, RNA or nucleic acid analogues 
such as PNA, to the control regions of the genes encoding the proteins of 
the invention and homologous proteins, i.e., the promoters, enhancers, and 
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introns. Oligonucleotides derived from the transcription initiation site, e.g., 
between positions -1 0 and + 10 from the start site, are preferred. Similarly, 
inhibition can be achieved using "triple helix" base-pairing methodology. 
Triple helix pairing is useful because it cause inhibition of the ability of the 
double helix to open sufficiently for the binding of polymerases, 
transcription factors or regulatory molecules. Recent therapeutic advances 
using triplex DNA have been described in the literature (Gee, J. E. et al. 
(1994) In; Huber, B. E. and B. I. Carr, Molecular and Immunologic 
Approaches, Future Publishing Co., Mt. Kisco, N.Y.). The antisense 
molecules may also be designed to block translation of mRNA by 
preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the 
specific cleavage of RNA. The mechanism of ribozyme action involves 
sequence-specific hybridization of the ribozyme molecule to complementary 
target RNA, followed by endonucleolytic cleavage. Examples, which may 
be used, include engineered hammerhead motif ribozyme molecules that 
can be specifically and efficiently catalyze endonucleolytic cleavage of 
sequences encoding the proteins of the invention and homologous 
proteins. Specific ribozyme cleavage sites within any potential RNA target 
are initially identified by scanning the target molecule for ribozyme 
cleavage sites which include the following sequences: QUA, QUU, and 
GUC. Once identified, short RNA sequences of between 15 and 20 
ribonucleotides corresponding to the region of the target gene containing 
the cleavage site may be evaluated for secondary structural features which 
may render the oligonucleotide inoperable. The suitability of candidate 
targets may also be evaluated by testing accessibility to hybridization with 
complementary oligonucleotides using ribonuclease protection assays. 

Nucleic acid effector molecules, e.g. antisense molecules and ribozymes of 
the invention may be prepared by any method known In the art for the 
synthesis of nucleic acid molecules. These include techniques for 
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chemically synthesizing ,. oliBonucleotides such as solid k 

Phosphoramidite chemical synthesis. Alternatively RNA m , 

generated .y In ^d in y,y„ t.anscn.tio„t; ol Z^JZ^, 

Huiymerase promoters such as T7 or SPfi ai+« ^. . 
cons.n,c.s that synthesize antisense RNA cll2 7 "'"'^ 
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thymine, and uridine which are not as easily r^c^^^^^TZ 
endonucleases. euognized by endogenous 



Many methods for introducing vectors into cells or ti« 

.,u.lly suitahie for use in .Vo. in vitro. JZZT^Z:: ZTZ' 

vectors may be introduced into stem cells taRen from * 

clonally propagated for autologous transplant barin™*« 

Delivery by transfectioh ,„h k ,• P^*- 

n-ethods. i^J:::::::^:^^^^ — 

described above mav h» , r T ' "»thods 

e.amp,e. mammairrarr ^ c~r!:ir '"^ 
and most preferably, humans. ' 



An additional embodiment of the Invention relates to th. «w • • 

a pharmaceutical composition In administration of 
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Such pharmaceutical compositions may consist of the nucleic acids and the 
proteins of the invention and homologous nucleic acids or proteins, 
antibodies to the proteins of the invention and homologous proteins, 
mimetics, agonists, antagonists or inhibitors of the proteins of the 
invention and homologous proteins or nucleic acids. The compositions may 
be administered alone or in combination with at least one other agent, such 
as stabilizing compound, which may be administered in any sterile, 
biocompatible pharmaceutical carrier, including, but not limited to, saline, 
buffered saline, dextrose, and water. The compositions may be 
administered to a patient alone or in combination with other agents, drugs 
or hormones. The pharmaceutical compositions utilized in this invention 
may be administered by any number of routes including, but not limited to, 
oral, intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, 
intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, 
enteral, topical, sublingual or rectal means. 

In addition to the active ingredients, these pharmaceutical compositions 
may contain suitable pharmaceuticaliy-acceptable carriers comprising 
excipients and auxiliaries, which facilitate processing of the active 
compounds into preparations, which can be used pharmaceutically. Further 
details on techniques for formulation and administration may be found in 
the latest edition of Remington's Pharmaceutical Sciences <Maack 
Publishing Co., Easton, Pa.). 

Pharmaceutical compositions suitable for use in the invention include 
compositions wherein the active ingredients are contained in an effective 
amount to achieve the intended purpose. The determination of an effective 
dose is well within the capability of those skilled in the art. For any 
compound, the therapeutically effective dose can be estimated initially 
either In cell culture assays, e.g., of preadipocyte cell lines or in animal 
models, usually mice, rabbits, dogs or pigs. The animal model may also be 
used to determine the appropriate concentration range and route of 
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administration. Such information can then be used to determine useful 
doses and routes for administration in humans, A therapeutically effective 
dose refers to that amount of active ingredient, for example the nucleic 
acids or the proteins of the invention or fragments thereof or antibodies 
which is sufficient for treating a specific condition. Therapeutic efficacy 
and toxicity may be determined by standard pharmaceutical procedures in 
cell cultures or experimental animals, e.g., ED50 (the dose therapeutically 
effective in SQo/o of the population) and LD50 (the dose lethal to 50% of 
the population). The dose ratio between therapeutic and toxic effects Is the 
therapeutic index, and it can be expressed as the ratio. LD50/ED50 
Pharmaceutical compositions, which exhibit large therapeutic indices are 
preferred. The data obtained from cell culture assays and animal studies is 
used in formulating a range of dosage for human use. The dosage 
contained in such compositions is preferably within a range of circulating 
concentrations that Include the ED50 with little or no toxicity. The dosage 
varies within this range depending upon the dosage from employed 
sensitivity of the patient, and the route of administration. The exact dosage 
W.II be determined by the practitioner, in light of factors related to the 
subject that requires treatment. Dosage and administration are adjusted to 
provide sufficient levels of the active moiety or to maintain the desired 
effect. Factors, which may be taken Into account. Include the severity of 
the disease state, general health of the subject, age, weight, and gender of 
the subject, diet, time and frequency of administration, drug 
comb.nation(s), reaction sensitivities, and tolerance/response to therapy 
Long-acting pharmaceutical compositions may be administered every 3 to 
4 days, every week or once every two weeks depending on half-life and 
clearance rate of the particular formulation. Normal dosage amounts may 
vary from 0.1 to 100,000 ^g, up to a total dose of about 1 g, depending 
upon the route of administration. Guidance as to particular dosages and 
methods of delivery Is provided in the literature and generally available to 
practitioners in the art. Those skilled in the art employ different 
formulations for nucleotides than for proteins or their inhibitors. Similarly 
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delivery of polynucleotides or polypeptides will be specific to particular 
cells, conditions, locations, etc. 

In another embodiment, antibodies which specifically bind to the proteins 
may be used for the diagnosis of conditions or diseases characterized by or 
associated with over- or underexpression of the proteins of the invention 
and homologous proteins or in assays to monitor patients being treated 
with the proteins of the invention and homologous proteins, or effectors 
thereof, e.g. agonists, antagonists, or inhibitors. Diagnostic assays include 
methods which utilize the antibody and a label to detect the protein in 
human body fluids or extracts of cells or tissues. The antibodies may be 
used with or without modification, and may be labeled by joining them, 
either covalently or ncn-covalently, with a reporter molecule. A wide 
variety of reporter molecules, which are Icnown In the art may be used 
several of which are described above. 

A variety of protocols including ELISA, RIA, and FACS for measuring 
proteins are known in the art and provide a basis for diagnosing altered or 
abnormal levels of gene expression. Normal or standard values for gene 
expression are established by combining body fluids or cell extracts taken 
from normal mammalian subjects, preferably human, with antibodies to the 
protein under conditions suitable for complex formation. The amount of 
standard complex formation may be quantified by various methods, but 
preferably by photometric means. Quantities of protein expressed in control 
and disease, samples e.g. from biopsied tissues are compared with the 
standard values. Deviation between standard and subject values 
establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides specific for 
the proteins of the invention and homologous proteins may be used for 
diagnostic purposes. The polynucleotides, which may be used, include 
oligonucleotide sequences, antisense RNA and DNA molecules, and PNAs. 
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The polyn.cleo,id,s may be used to detect and quan«tate gene expression 
.n biopsied tissues in which gene expression may be correiated with 
d.sease. The diagnostic assay may be used to distinguish between 
absence, presence, and axcess gene expression, and to monitor regulation 
of protein levels during therapeutic intervention. 

In one aspect, hybridization with probes which are capable of d«e«lno 
polynucleotide sequences, including genomic sequences, encoding th! 
proteins of the invention and homologous proteins or closely related 
rnolecules, may be used to identify nucleic acid sequences which encode 
the respective protein. The hybridization probes of the sub,ec, invention 
may be DNA or RNA and are preferably derived from ti,e nucleotide 
sequence of ti,e polynucleotide encoding ti» proteins of ti,. invention or 
from a genomic sequence Including promoter, enhancer elements, and 
introns of the naturally occurrtng gene. Hybridization probe, maybe labeled 
by a variety of reporter groups, for exemple, radionuclides such as '^P or 
S or enzymatic labels, such a, alkaline phosphatase coupled to tine probe 
via avidin/biotin coupling systems, and tiie like. 

Polynucleotide s«,uences specific for the proteins of «,e invention and 
homologous nucleic acids may be used for the diagnosis of conditions or 
d»eases, which are associated with the expression of «,e proteins 
Examples of such conditions or diseases include, but are not limited to' 
metabolic diseases and disorders, including obesity, diabetes, and/or 
-netabolic syndrome. Polynucleotide sequences specific fortine proteins of 
the invention and homologous proteins may also be used to monttor ti,e 
progress of patients receiving treatment for metabolic diseases and 
disorders, including obesity, diabetes, end/or metabolic syndrome. The 
polynucleotide sequences may be us«l qualitative or quantitative assays 
e.g. in Soutiiern or Nortiiern analysis, dot blot or o«,er membrane-based 
technologies; in PCR technologies; or In dip stick, pin, ELISA or chip assays 
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utilizing fluids or tissues from patient biopsies to detect altered gene 
expression. 



In a particular aspect, the nucleotide sequences specific for the proteins of 
the invention and homologous nucleic acids may be useful in assays that 
detect activation or induction of various metabolic diseases and disorders, 
including obesity, diabetes, and/or metabolic syndrome. The nucleotide 
sequences may be labeled by standard methods, and added to a fluid or 
tissue sample from a patient under conditions suitable for the formation of 
hybridization complexes. After a suitable incubation period, the sample is 
washed and the signal is quantitated and compared with a standard value. 
The presence of altered levels of nucleotide sequences encoding the 
proteins of the invention and homologous proteins in the sample indicates 
the presence of the associated disease. Such assays may also be used to 
evaluate the efficacy of a particular therapeutic treatment regimen in 
animal studies. In clinical trials or in monitoring the treatment of an 
individual patient. 



In order to provide a basis for the diagnosis of a disease associated with 
expression of the proteins of the invention and homologous proteins, a 
normal or standard profile for expression is established. This may be 
accomplished by combining body fluids or cell extracts taken from normal 
subjects, either animal or human, with a sequence or a fragment thereof, 
which is specific for the nucleic acids encoding the proteins of the 
invention and homologous nucleic acids, under conditions suitable for 
hybridization or amplification. Standard hybridization may be quantified by 
comparing the values obtained from normal subjects with those from an 
experiment where a known amount of a substantially purified 
polynucleotide is used. Standard values obtained from normal samples may 
be compared with values obtained from samples from patients who are 
symptomatic for disease. Deviation between standard and subject values 
Is used to establish the presence of disease. Once disease is established 



and a treatment protocol is initiated, hybridization assays may be repeated 
on a regular basis to evaluate whether the level of expression In the patient 
begins to approximate that, which is observed in the normal patient. The 
results obtained from successive assays may be used to show the efficacy 
of treatment over a period ranging from several days to months. 

With respect to metabolic diseases such as described above the presence 
of an unusual amount of transcript in biopsied tissue from an individual 
may indicate a predisposition for the development of the disease or may 
provide a means for detecting the disease prior to the appearance of actual 
clinical symptoms. A more definitive diagnosis of this type may allow 
health professionals to employ preventative measures or aggressive 
treatment earlier thereby preventing the development or further progression 
of the metabolic diseases and disorders. 

Additional diagnostic uses for oligonucleotides designed from the 
sequences encoding the proteins of the invention and homologous proteins 
may Involve the use of PGR. Such oligomers may be chemically 
synthesized, generated enzymaticaily or produced from a recombinant 
source. Oligomers will preferably consist of two nucleotide sequences, one 
with sense orientation (5prime.fwdarw.3prime) and another with antisense 
(3prime.rarw.5prime), employed under optimized conditions for 
identification of a specific gene or condition. The same two oligomers, 
nested sets of oligomers or even a degenerate pool of oligomers may be 
employed under less stringent conditions for detection and/or quantification 
of closely related DNA or RNA sequences. 

In another embodiment of the invention, the nucleic acid sequences may 
also be used to generate hybridization probes, which are useful for 
mapping the naturally occurring genomic sequence. The sequences may be 
mapped to a particular chromosome or to a specific region of the 
chromosome using well known techniques. Such techniques include FISH, 
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FACS or artificial chromosome constructions, such as yeast artificial 
chromosomes, bacterial artificial chromosomes, bacterial PI constructions 
or single chromosome cDNA libraries as reviewed in Price, C. M. (1993) 
Blood Rev. 7:127-134, and Trask, B. J. (1991) Trends Genet. 7:149-154. 
FISH <as described in Verma et al. (1 988) Human Chromosomes: A Manual 
of Basic Techniques, Pergamon Press, New York, N.Y.). The results may 
be correlated with other physical chromosome mapping techniques and 
genetic map data. Examples of genetic map data can be found in the 1 994 
Genome Issue of Science (265:1 981 f). Correlation between the location of 
the gene encoding the proteins of the invention on a physical chromosomal 
map and a specific disease or predisposition to a specific disease, may help 
to delimit the region of DNA associated with that genetic disease. 

The nucleotide sequences of the subject invention may be used to detect 
differences in gene sequences between normal, carrier or affected 
individuals. An analysis of polymorphisms, e.g. single nucleotide 
polymorphisms may be carried out. Further, in situ hybridization of 
chromosomal preparations and physical mapping techniques such as 
linkage analysis using established chromosomal markers may be used for 
extending genetic maps. Often the placement of a gene on the 
chromosome of another mammalian species, such as mouse, may reveal 
associated markers even if the number or arm of a particular human 
chromosome is not known. New sequences can be assigned to 
chromosomal arms or parts thereof, by physical mapping. This provides 
valuable information to investigators searching for disease genes using 
positional cloning or other gene discovery techniques. Once the disease or 
syndrome has been crudely localized by genetic linkage to a particular 
genomic region, for example, AT to 1 1q22-23 (Gatti, R. A. et al. (1988) 
Nature 336:577-580), any sequences mapping to that area may represent 
associated or regulatory genes for further investigation. The nucleotide 
sequences of the subject invention may also be used to detect differences 



in the chromosomal location due to translocation, inversion, etc. 
normal, carrier or affected individuals. 



In another embodiment of the invention, the proteins of the invention, their 
catalytic or immunogenic fragments or oligopeptides thereof, an in vitro 
model, a genetically altered ceil or animal, can be used for screening 
libraries of compounds in any of a variety of drug screening techniques. 
One can identify effectors, e.g. receptors, enzymes, proteins, ligands, or 
substrates that bind to. modulate or mimic the action of one or more of the 
proteins of the invention. The protein or fragment thereof employed in such 
screening may be free in solution, affixed to a solid support, borne on a cell 
surface, or located Intracellulary. The formation of binding complexes, 
between the proteins of the invention and the agent tested, may be 
measured. Agents could also, either directly or indirectly, influence the 
activity of the proteins of the invention. 

In addition activity of the proteins of the invention against their 
physiological substrate(s) or derivatives thereof could be measured in 
cell-based assays. Agents may also interfere with posttranslational 
modifications of the protein, such as phosphorylation and 
dephosphorylation, farnesylation, palmitoylation, acetylation, alkyJation, 
ubiqultination, proteolytic processing, subcellular localization and 
degradation. Moreover, agents could influence the dimerization or 
oligomerization of the^ proteins of the invention or, in a heterologous 
manner, of the proteins of the invention with other proteins, for example, 
but not exclusively, docking proteins, enzymes, receptors, or translation 
factors. Agents could also act on the physical interaction of the proteins of 
this invention with other proteins, which are required for protein function, 
for example, but not exclusively, their downstream signaling. 

Methods for determining protein-protein Interaction are well known in the 
art. For example binding of a fluorescently labeled peptide derived from the 
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interacting protein to the protein of the Invention, or vice versa, could be 
detected by a change in polarisation. In case that both binding partners, 
which can be either the full length proteins as well as one binding partner 
as the full length protein and the other just represented as a peptide are 
fiuorescently labeled, binding could be detected by fluorescence energy 
transfer (FRET) from one fluorophore to the other. In addition, a variety of 
commercially available assay principles suitable for detection of 
protein-protein Interaction are well known In the art, for example but not 
exclusively AlphaScreen (PerkinElmer) or Scintillation Proximity Assays 
(SPA) by Amersham. Alternatively, the interaction of the proteins of the 
invention with cellular proteins could be the basis for a cell-based screening 
assay, in which both proteins are fiuorescentiy labeled and interaction of 
both proteins is detected by analysing cotranslocation of both proteins with 
a cellular imaging reader, as has been developed for example, but not 
exclusively, by Cellomics or EvotecOAI. In ail cases the two or more 
binding partners can be different proteins with one being the protein of the 
invention, or in case of dimerization and/or oligomerization the protein of 
the invention itself. Proteins of the invention, for which one target 
mechanism of interest, but not the only one, would be such protein/protein 
interactions are PSAP, SFTPB, and/or FLJ40379. 

Of particular interest are screening assays for agents that have a low 
toxicity for mammalian cells. The term "agent" as used herein describes 
any molecule, e.g. protein or pharmaceutical, with the capability of altering 
or mimicking the physiological function of one or more of the proteins of 
the invention. Candidate agents encompass numerous chemical classes, 
though typically they are organic molecules, preferably small organic 
compounds having a molecular weight of more than 50 and less than 
about 2,500 Daltons. Candidate agents comprise functional groups 
necessary for structural interaction with proteins, particularly hydrogen 
bonding, and typically include at least an amine, carbonyl, hydroxy! or 
carboxyl group, preferably at least two of the functional chemical groups. 



The candidate agents often comprise carbocyclic or heterocyclic structures 
and/or aromatic or polyaromatic structures substituted with one or more of 
the above functional groups. 

Candidate agents are also found among biomolecules including peptides, 
saccharides, fatty acids, steroids, purines, pyrimidines, nucleic acids and 
derivatives, structural analogs or combinations thereof. Candidate agents 
are obtained from a wide variety of sources including libraries of synthetic 
or natural compounds. For example, numerous means are available for 
random and directed synthesis of a wide variety of organic compounds and 
biomolecules, including expression of randomized oligonucleotides and 
oligopeptides. Alternatively, libraries of natural compounds in the form of 
bacterial, fungal, plant and animal extracts are available or readily 
produced. Additionally, natural or synthetically produced libraries and 
compounds are readily modified through conventional chemical, physical 
and biochemical means, and may be used to produce combinatorial 
libraries. Known pharmacological agents may be subjected to directed or 
random chemical modifications, such as acylation, alkylation, esterification, 
amidiftcation, etc. to produce structural analogs. Where the screening 
assay is a binding assay, one or more of the molecules may be joined to a 
label, where the label can directly or indirectly provide a detectable signal. 

Another technique for drug screening, which may be used, provides for 
high throughput screening of compounds having suitable binding affinity to 
the protein of interest as described in published PCT application 
W084/03564. In this method, as applied to the proteins of the invention 
large numbers of different small test compounds, e.g. aptamers, peptides, 
low-molecular weight compounds etc., are provided or synthesized on a 
solid substrate, such as plastic pins or some other surface. The test 
compounds are reacted with the proteins or fragments thereof, and 
washed. Bound proteins are then detected by methods well known in the 
art. Purified proteins can also be coated directly onto plates for use in the 
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aforementioned drug screening techniques. Alternatively, non-neutralizing 
antibodies can be used to capture the peptide and immobilize it on a solid 
support. In another embodiment, one may use competitive drug screening 
assays in which neutralizing antibodies capable of binding the protein 
specifically compete with a test compound for binding the protein. In this 
manner, the antibodies can be used to detect the presence of any peptide, 
which shares one or more antigenic determinants with the protein. 

The nucleic acids encoding the proteins of the invention can be used to 
generate transgenic cell lines- and animals. These transgenic non-human 
animals are useful in the study of the function and regulation of the 
proteins of the invention in vivo. Transgenic animals, particularly 
mammalian transgenic animals, can serve as a model system for the 
investigation of many developmental and cellular processes common to 
humans. A variety of non-human models of metabolic disorders can be 
used to test modulators of the protein of the invention. Misexpression (for 
example, overexpression or lack of expression) of the protein of the 
invention, particular feeding conditions, and/or administration of 
biologically active compounts can create models of metablic disorders. 

In one embodiment of the invention, such assays use mouse models of 
insulin resistance and/or diabetes, such as mice carrying gene knockouts in 
the leptin pathway (for example, ob (leptin) or db (leptin receptor) mice). 
Such mice develop typical symptoms of diabetes , show hepatic lipid 
accumulation and frequently have increased plasma lipid levels (see 
Bruning et al, 1998, Mol. Cell. 2:449-569). Susceptible wild-type mice (for 
example C57Bi/6) show similiar symptoms if fed a high fat diet. In addition 
to testing the expression of the proteins of the invention in such mouse 
strains (see EXAMPLE 4), these mice could be used to test whether 
administration of a candidate modulator alters for example lipid 
accumulation in the liver, in plasma, or adipose tissues using standard 



assays well known In .he art, such as FPLC. co.orim«ric assays, biood 
glucose level tests, insulin tolerance tests and others. 

Transgenic animals may be mad. through homologous recombination in 
. embryon-c stem cells, where the normal locus o, the gene encoding the 
protein of the invention is mutated. Alternatively, a nucleic acid construct 
encod.„g the protein is injected into oocytes and is randomly integrated 
■nto the genome. One may also express the genes of the invention or 
vanants thereof in tissues where they are not nom»lly expressed or at 
abnormal times of development. Furthermore, variants of the genes of the 
.nvenfon like specific cons^ucta expressing anti-sense molecule! ^ 
express,on of dominant negative mutations, which will block or alter the 
expression o, the proteins o, the invention may be randomly integrated into 
the genome. A detectable marker, such as lac Z or luciferase may be 
mtroduced into «,e locus of the genes of the invention, where upregulation 

LTr'T °' ™" - an easily 

de«c^b,e Change k, phenotype. Vectors for stable integration include 
Plasm.ds. retroviruses and other animal viruses, yeast artificial 
^rcmoscmes OTACs,, and the like. ONA constructs for homoju 

w,th the des,red genefc modification, and will include regions of homotagy 
o tf,e target locus. Conveniently, markers for positive and negativl 
ae.ect,on are .nCuded. DNA constructs for random integration do not need 
c contain regions of hdmology to mediate recombination. DNA constmcts 
for random integration will consist of the nucleic aoids encoding the 
prote-ns of the .nvention, a regulatory element (promoter), an intron and a 
Poly-adeny,atk,n signal. Methods for generating cells having targeted gene 
modihcafons through homologous recombination are known in the field 
For embryonic stem (ES) cells, an ES cell line may be employed, or 
ambryon-c cells may be obtained freshly from a host. e.g. mouse, ra, 
9u»,ea p,g. etc. Such cells are grown on an appropriate fibroblast-feeder 
layer and are grown in the pr«,e.nce of leukemia inhibiting factor (UF) ES 
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or embryonic cells may be transfected and can then be used to produce 
transgenic animals. After transfection, the ES cells are plated onto a feeder 
layer in an appropriate medium. Cells containing the construct may be 
selected by employing a selection medium. After sufficient time for 
colonies to grow, they are picked and analyzed for the occurrence of 
homologous recombination. Colonies that are positive may then be used for 
embryo manipulation and morula aggregation. Briefly, morulae are obtained 
from 4 to 6 week old superovulated females, the Zona Pellucida is removed 
and the morulae are put into small depressions of a tissue culture dish. The 
ES cells are trypsinized, and the modified cells are placed into the 
depression closely to the morulae. On the following day the aggregates are 
transfered into the uterine horns of pseudopregnant females. Females are 
then allowed to go to term. Chimeric offsprings can be readily detected by 
a change in coat color and are subsequently screened for the transmission 
of the mutation into the next generation (F1 -generation). Offspring of the 
F1 -generation are screened for the presence of the modified gene and 
males and females having the modification are mated to produce 
homozygous progeny. If the gene alterations cause lethality at some point 
in development, tissues or organs can be maintained as allogenic or 
congenic grafts or transplants, or in vitro culture. The transgenic animals 
may be any non-human mammal, such as laboratory animal, domestic 
animals, etc., for example, mouse, rat, guinea pig, sheep, cow, pig, and 
others. The transgenic animals may be used In functional studies, drug 
screening, and other applications and are useful in the study of the 
function and regulation of the proteins of the invention in vivo. 

Finally, the invention also relates to a kit comprising at least one of 

(a) a nucleic acid molecule coding for a protein of the invention or a 
fragment thereof; 

(b) a protein of the invention or a fragment or an isoform thereof; 

(c) a vector comprising the nucleic acid of (a); 

(d) a host cell comprising the nucleic acid of (a) or the vector of |b); 



-40-.,. ■ • 

(e) a polypeptide encoded by the nucleic acid of (a); 

a fusion polypeptide encoded by the nucleic acid of (a); 
an antibody, an aptamer or another effector against the nucleic acid 
of (a) or the polypeptide of (b), (e) or (f) and 
an anti-sense oligonucleotide of the nucleic acid of (a). 



(f) 

(g) 

(h) 



The kit may be used for diagnostic or therapeutic purposes or for screening 
applications as described above. The kit may further contain user 
instructions. 
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Figure 1 shows the content of energy storage metabolites (ESM; 
triglyceride (TG) and glycogen) of Drosophila Saposin-related (GadFly 
Accession Number CG1 2070) mutants. Shown is the change of triglyceride 
content of HD-EP(3)36824 flies caused by integration of the P-vector into 
the annotated transcription unit ('HD-36824 (TG, JQocy, column 3 and 
'HD-36824 (TG, 900C)', column 6) in comparison to controls containing 
about 880 fly lines of the proprietary EP collection CHD-control (TG, 
70°C)'), column 1) and wild-type controls determined In 4 Independent 
assays (referred to as 'WT-control (TG, 70<»C)', column 2), and in 
comparison to controls containing about 2100 fly lines of the proprietary 
EP collection CHD-confrol (TG, 900C)'), column 4) and wild-type controls 
determined in more than 80 independent assays (referred to as 'WT-control 
(TG, 90OC)', column 5). Also shown Is the change of glycogen content of 
HD-EP(3)36824 flies caused by Integration of the P-vector the Into the 
annotated transcription unit ('HD-36824 (glycogen, 90«»C)', column 8) in 
comparison to an internal assay control including two wild-type strains and 
a one HD-line (referred to as 'assay control (glycogen, 90°C)' column 7). 
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Figure 2 shows the molecular organization of the mutated Saposin-related 
(Sap-r; GadFly Accession Number CG12070) gene locus. 

Figure 3 shows human saposin-related proteins: Prosaposin (PSAP), 
human surfactant, pulmonary-associated protein B (SFTPB), or human 
hypothetical protein FLJ40379 

Figure 3A shows the nucleic acid sequence of human PSAP (SEQ ID NO: 
1) 

Figure SB shows the amino acid sequence (one-letter code) of human PSAP 
(SEQ ID NO: 2) 

Figure 3C shows the nucleic acid sequence of human SFTPB (SEQ ID NO: 
3) 

Figure 3D shows the amino acid sequence (one-letter code) of human 
SFTPB (SEQ ID NO: 4) 

Figure 3E shows the nucleic acid sequence of human FLJ40379 (SEQ ID 
NO: 5) 

Figure 3F shows the amino acid sequence (one-letter code) of human 
FLJ40379 (SEQ ID NO: 6) 

Figure 3G shows the comparison (ClustalW (1.83) protein sequence 
alignment analysis) of human and Drosophila protein. Gaps in the alignment 
are represented as In the figure 'PSAP Hs' refers to human prosaposin, 
'FLJ40379 Hs' refers to human hypothetical protein FLJ40379, 'SFTPB 
Hs' refers to human surfactant, pulmonary-associated protein B, and 'Sap-r 
Dm' refers to Drosophila Saposin-related protein . 

Figure 4 shows the quantitative analysis of prosaposin (Psap) expression in 
mammalian tissues. The relative RNA-expresslon is shown on the Y-axis, in 
Figure 4A to 4C the tissues tested are given on the X-axis. WAT refers to 
white adipose tissue, BAT refers to brown adipose tissue. In Figure 4D, the 
X-axis represents the time axis. 'dO' refers to day 0 (start of the 
experiment), 'd2' - 'dIO' refers to day 2 - day 10 of adipocyte 
differentiation). 
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Figure 4A shows *e quantitative analysis of |.sap expression in mouse 
Wild-type tissues. 

Fioure 4B sl,ows the quantitative analysis of Psap expression in wild-type 
m,c. (wt-mioe). compared to genetically obese mice (ob/ob^nice) and to 
fasted mice Ifasted-mice). 

Figure 4C shows the quantitative analysis of Psap expression in mice fed 
witii a control diet compared to mice fed with a high fat diet 
Figure 40 shows the quantitative analysis of Psap expression in 
mammalian fibroblast ,3T3-L1, cells, during the differentiation from 
preadipocytes to mature adipocytes. 

Figure 5 shows 'the quantitative anemia of prosaposm expression in 
human abdominal adipocyte cells, during «,e differentiation from 
preadipocytes to mature edipocytes. 



The examples illustrate the invention: 

Example 1 : Measurement of energy storage metabolites (ESM) contents in 
Orosophfia 

Mutant flies are obtained from a fly mutation stock collection. TH> flies are 
grown under standard conditions known to those skilled in the art. In ti,e 
course of the experiment, additional feedings with bakers yeast 
^accharomyces cerevisiae) are provided for «,e EPHlne H0-EPt3)36824 
The average change of triglyceride and glycogen (herein referred to as 
energy storage metabolitea, ESM) content of DrosophHa comaining the 
6P-vectors as homozygous viable Integration was investigated in 
comparison to control files, respectively (see r,gure „. For determination 
Of ESM content, flies were incubated for 5 mln at 70«C or 80«C in an 
aqueous buffer using a waterbatt,. followed by ho, exuaction. After 
ano«,er 5 mln incubation at 70»C or 90»C and mild centrifugation. «,e 
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triglyceride content of the flies extract was determined using Sigma 
Triglyceride (INT 336-10 or -20) assay by measuring changes in the optical 
density according to the manufacturer's protocol, and the glycogen content 
of the flies extract was determined using Roche (Starch UV-method Cat- 
No, 0207748) assay by measuring changes in the optical density according 
to the manufacturer's protocol. As a reference the protein content of the 
same extract was measured using BIO-RAD DC Protein Assay according to 
the manufacturer's protocol. These experiments and assays were repeated 
several times. 

The average triglyceride level of 883 fly lines of the proprietary 
EP-collection determined at 70^C (referred to as 'HD-control (TG, 70^C)') 
is shown as 100% in the first column in Figure 1 . The average triglyceride 
level of Drosophila wild-type strain Oregon R flies determined in 4 
independent assays at 70°C (referred to as 'WT-control (TG, 70^C)') is 
shown as 1 1 6% in the second column in Figure 1 . The average triglyceride 
level {jjQ triglyceride///g protein) of 2108 fly lines of the proprietary 
EP-collection determined at 90°C (referred to as 'HD-control (TG, 90**C)') 
is shown as 100% in the fourth column in Figure 1. The average 
triglyceride level {/ig triglyceride///g protein) of Drosophila wild-type strain 
Oregon R flies determined in 84 independent assays at 90**C (referred to 
as 'WT-control (TG, 90«C)') is shown as 102% in the fifth column of 
Figure 1 . The average glycogen level Ifig glycogen/jt/g protein) of an internal 
assay control consisting of two different wild-type strains and an 
inconspicuous EP-line of the HD stock collection (referred to as 'assay 
control (glycogen, 90*^C)') is shown as 100% in the seventh column in 
Figure 1 . The average triglyceride level {jjg triglyceride///g protein) of all 
flies of the EP collection (referred to as 'EP-controI') is shown as 100% in 
the first column in Figure 5. Standard deviations of the measurements are 
shown as thin bars. 



HD-EP(3)36824 homozygous flies show constantly a lower triglyceride 
content than the controls (column 3 in Figure 1, 'HD-36824 (TG, TO^C)'; 
column 6 in Figure 1, 'HD-36824 (TG, 90°C)'). H0-EP(2)21554 
homozygous flies also show a lower glycogen content than the controls 
(column 8 in Figure 1. 'HD-36824 (glycogen, 90oC)'). Therefore, the loss 
of gene activity is responsible for changes in the metabolism of the energy 
storage metabolites. 

Example 2: identification of Drosophila genes responsible for changes in 
metabolite contents 

Genomic DNA sequences were isolated that are localized directly adjacent 
to the EP vector (herein HD-EP(3)36824) integration. Using the isolated 
genomic sequences from HD-EP(3>36824 homozygous flies public 
databases like Berkeley Drosophila Genome Project (GadFly) were 
screened. The chromosomal localization site of HD-EP(3)36824 vector 
integration is at gene locus 3R, 100A6-7 (Flybase and Gadfly); The 
homozygous viable integration site of the HD-EP(3)36824 vector into base 
pair 535 of the Sap-r transcript CG12070-RA and 61 base pairs Sprime of 
the Sap-r transcript CQ 1 2070-RB in antisense orientation was confirmed. 
Therefore, expression of the cDNAs ehcoded by Sap-r could be affected by 
integration of the vector of line HD-EP(3)36824 leading to a change in the 
amount of energy storage metabolites. Figure 2 shows the molecular 
organization of this gene locus. A black double arrow in middle of the 
Figure represents the genomic DNA sequence. The space between two 
ticks represents a stretch of 1 000 base pairs. The black triangle labeled 
'HD-EP36824' indicates the Integration site of the EP-vector. The 
transcripts of Sap-r (as predicted by the Berkeley Drosophila Genome 
Project) are shown as dark gray bars (exons) linked by dark gray lines 
(introns) in the lower half of the Figure labeled as 'Sap-r'. 
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Table 1 is summarizing the data of our molecular analysis of the Drosophila 
proteins identified in this invention as being involved in the regulation of 
the metabolism. 

Table 1 . Molecular analysis of Drosophila Saposin-related 



Analysis 


Result 


Protein domains 


Beta-Ig-H3/Fasciclm ^Flvbase^ 


InterPro analysis 


Saposin tvue B, Saoosin tvpe A. Surfactant protein B 


Locus 


3R. 100A6-7 rFlvbase. Gadflv release 3^ 


cDNA (sap-r) 


AI108030 (578 base pairs mRNA), AI109190 (63S base 
pairs mRNA"i 


genomic DNA 


AE003775 


RefSeq TSap-r^ 


NM 079858. MM 170529. NP 524597. NP 733408 


Drosophila mutations 
& mutants 


Homozygotes for deficiencies removing Sap-r are fertile 
and viable and show no obvious phenotvpe. (Plvbase^ 



Example 3: Identification of the human Saposin-related homologous 
proteins 

Saposin-related homologous proteins and nucleic acid molecules coding 
therefore are obtainable from insect or vertebrate species, e.g. mammals or 
birds, preferably from humans, mouse, or Drosophila. Sequences 
homologous to Drosophila Saposin-related were identified using the 
publicly available program BLASTP 2.2.3 of the non-redundant protein data 
base of the National Center for Biotechnology Information (NCBI) (see, 
Altschul et al., 1997, Nucleic Acids Res. 25:3389-3402). Table 2 shows 
preferred human homologs of the Drosophila Saposin-related gene. 
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The term -polynucleotide comprising rte nucleotide sequence as si,own in 
GenBank Accession number" relates to tt,e expressible gene of the 
nucleotide sequences deposited under the corresponding GenBank 
Accession number. The term "GenBank Accession number" relates to NCBI 

Table 2. Human proteins homologous to Drosophila Saposln-related 

proteins 

J. Human proteins homologous to Drosophila Sap-r 
lA. Human prosaposin (PSAP) 

NCBI human locus identification (ID): 5660, Hs PSAP, prosaposin (variant 
Gaucher disease and variant metachromatic leukodystrophy), 10q21.q22 
RefSeq: GenBank Accession Number: NM_002778 

IB. Human surfactant, pulmonary-associated protein B (SFTPB) 

NCBI human locus identification (ID): 6439; Hs SFTPB, surfactant 

pulmonary-associated protein B, 2p12-p1l.2 

Aliases: SP-B, PSP-B, SFTB3, SFTP3 

RefSeq: GenBank Accession Number: NM_000542 

IC. Human hypothetical protein FLJ40379 

Nucleotide: ENSEMBL Accession Number: ENSG000001 73005 

Protein: ENSEMBL Accession Number: ENSP00000308224, Internationa. 

Protein Index (IPI) Accession Number: IPI001 63920 
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Example 4: Identification of secreted factors expressed in pancreas 

A screen for secreted factors expressed in developing mouse pancreas was 
carried out according to methods known by those skilled in the art (see, for 
example Pera E.M. and De Robertis E.M., (2000) Mech Dev 96(2): 
183-195) with several modifications. 

Expression cDNA library: 

A mouse embryonic stage 9.5-15 pancreatic bud library was prepared in 
pCMVSPORT-6 vector using SUPERSCRIPT Plasmid System from 
Invitrogen (cat.# 18248) according to the manufacturer's instructions. The 
non-amplified library was electroporated into MaxEff DH10B ceils 
(Invitrogen). 

Secretion cloning 

Bacterial clones from agar plates were picked with sterile toothpicks and 
cultured in 96-deep-well microtiter plates in 1 ml of LB-ampicillin (see 
Sambrook et al., supra). Aliquots of 8 cultures were pooled, and plasmid 
DNAs isolated by BioRobot_9600 (QIAGEN) using QIAprep(r) Turbo 
BioRobot Kit (QIAGEN). Human 293 cells were cultured in 75 ml tissue 
culture flasks in DMEM and 10% fetal calf serum. At 90-99% confluence, 
the cells were splitted at 1:3 ratio and plated at poly-D-lysin (Sigma) 
coated 96-weH plates, 1 50 A/l/well. Next day the the cells were transfected 
with 100-500 ng plasmids using lipofectamine 2000 (Invitrogen), 0,5 
//l/well, in 100 //I/well Optimem (Invitrogen). After 6 h the medium was 
exchanged for fresh complete growth medium. 24 hours after transfection, 
the cells were washed twice with DMEM w/o Cysteine and Methionine 
(Invitrogen), supplemented with 1 % dialysed Bovine serum (Sigma) with 
50 mg/ml Heparin (Sigma) and glutamine. The cells were labeled In 50 
//l/well of the same medium and 0,75 //l/well (-835 Met-label (HARTMANN 
ANALYTIC Gmbh, #44138). 12 hours later, 10 jj\ aliquots of the 
supernatants were harvested in 96-well PCR plates and subjected to SDS 
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ge eectrophoresls in precast 4±20%. gradient polyaorylamide Criterion 
99ls (B,orad) under reducing conditions, using Criterion Dodeca Cell gel 
running cl,an,ber (Biorad). The gels were fixed in 10% acetic acid 25% 
isopropanol for 30 min. soaked ,5-30 min in AMPLIFY reagent 
(An»rshan.), dried and exposed to X-OMAT (AR, film (Kodak). Positive 
Clones were Wentified by sub-seiection. The 8 individual bacterial clones of 
each positive pool were regrown in 96-well-plates. DNA of individual 
clones was prepared and used for transfection as described, if one of the 
Cones yielded proteins of the san,e size as that of «,e original pool, a 
PO,*ve Clone was identified. Positive clones were partially sequenced from 
the 5 end (SEQLAB. Goettingen). Sequences (of about 500 nucIeo«des) 
were compared to public nucleotide and protein databases to reveal 
similarity to previously described proteins. 



Example 5: Expression of the polypeptides in mammalian tissues 

To analyse the expression of th. polypeptWes disclosed in this invention in 
mammellan tissues, several mouse streins (preferably mice strains 
C57BI/6J. C57BI/6 ob/ob and C57B1/KS db/db which are standard model 
systems In obesity and diabetes research, were purch;sed from Herien 
W,nke.mann ,33178 Borchen. Germany, and maintained under constant 
temperature (preferably 22oc,. 40 per cem humidity and a light / dark 
cycle of preferably 14/10 hours. The mice were fed a standard chow (for 
example from ssniff Spezialitaten GmbH, order number ssniff M-Z 
Vme-OOO,. For the fasting experiment ("fasted wild-type mice", 
w.id.^pe mice were starved for 48 h without food, bu, only water supplied 
ad !,b,tum (see. for example. Schnetzler et al. J Clin Invest 1993 
Jul;92,1,:272-80. Mizuno et al. Proc Natl Acad Scl USA 1996 Apr 
1 6;93(8,:3434-8,. Animals were sacrificed at an age of 6 to 8 weeks. The 
ammal tissues were isolated accordtog to standard procedures known to 
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those skilled in the art, snap frozen in liquid nitrogen and stored at -80^C 
until needed. 

For analyzing the role of the proteins disclosed in this invention In the in 
vitro differentiation of different mamnnalian cell culture cells for the 
conversion of pre-adipocytes to adipocytes, mamnnalian fibroblast (3T3-L1 ) 
cells (e.g.. Green & Kehinde, Cell 1: 113-116, 1974) were obtained from 
the American Tissue Culture Collection (ATCC, Hanassas, VA^ USA; 
ATCC- CL 173). 3T3-L1 cells were maintained as fibroblasts and 
differentiated into adipocytes.as described in the prior art (e.g., Qiu. et al., 
J, Biol. Chem. 276:11988-95, 2001; Slieker et aL, BBRC 251: 225-9, 
1998). In brief, cells were plated in DMEM/10% FCS (Invitrogen, 
Karlsruhe, Germany) at 50,000 cells/well in duplicates in 6-well plastic 
dishes and cultured in a humidified atmosphere of 5% COj at 37°C. At 
confluence (defined as day Or dO) cells were transferred to serum-free (SF) 
medium, containing DMEM/HamF12 (3:1; Invitrogen), fetuin (300 //g/ml; 
Sigma, Munich, Germany), transferrin (2|/g/ml; Sigma), pantothenate (17 
/jM; Sigma), biotin (1 jjM; Sigma), and EGF (0.8 nM; Hoffmann-La Roche, 
Basel, Switzerland). Differentiation was induced by adding dexamethasone 
(DEX; 1 fjM; Sigma), 3-methyl-isobutyl-1-methylxanthine (MIX; O.S mM; 
Sigma), and bovine insulin (5 //g/ml; Invitrogen). Four days after confluence 
(d4), ceils were kept in SF medium, containing bovine insulin (5 //g/ml) until 
differentiation was completed. At various time points of the differentiation 
procedure, beginning with day 0 (day of confluence) and day 2 (hormone 
addition; for example, dexamethasone and 3-isobutyl*1 -methylxanthine), up 
to 10 days of differentiation, suitable aliquots of cells were taken every 
two days. 

RNA was isolated from mouse tissues or cell culture cells using Trizol 
Reagent (for example, from Invitrogen, Karlsruhe, Germany) and further 
purified with the RNeasy Kit (for example, from Qiagen, Germany) in 
combination with an DNase-treatment according to the instructions of the 



manufacturers and as known to thosa skilled in the art. Total RNA was 
reverse transcribed (preferably using Superscript II RNaseH- Reverse 
Transcriptase, from Invitrogen. Karlsruhe, Germany) and subiected to 
Taqman analysis preferably using tt,e Taqman 2xPCR Master Mix (from 
Apphad Biosystems, W.iterstadt, Germany; the Mix contains according to 
tt» Manufacturer for example AmpliTaq Gold DNA Polymerase, AmpErase 
UNG, dIMTPs With dUTP, passive reference Rox and optimized buffer 
components, on a GeneAmp 5700 Sequence Detection System (from 
Applied Biosystems, Weiterstadt, Germany). 



Example 6. Analysis of the differential expression of transcripts of «,e 
proteins of the invention in human tissues 

RNA preparation from human primary adipose tissues was done as 
described in EXAMPLE 2. The hybridization and scanning was performed 
as described in the manufactures manual (see Affyoietrix Technical 
Manual, 2002, obtained from Affmetrix, Santa Clara, USA,. 

The exprasstan analysis (using Affymetrix GeneChips, of the psap gene 
using Primary human abdominal adipocycte differentiation clearty show, 
drfferentlal expression of human psap in adipocytes. Two independent 
experiments were done. Both experiments show that the psap transcripts 
are the most abundance at day O compared to day ,2 during 
differentiation. These data further confim, the mouse 3T3L-1 
differentiation data. 

Thus, the psap protein has to be significantly decreased In order for the 
preadipocyctes to differentiate into mature adipocycte. The psap protein 
m preadipocyctes has the potential to inhibit edipose differentiation. 
Therefore, the psap protein might play an essential role in the regulation of 
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human metabolism. In particular in the regulation of adipogenesis and thus 
^ it might be an essential role in obesity, diabetes, and/or metabolic 

syndrome. 
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A pharmaceutical composition comprising a nucieic acid molecule 
encoding Sapcsin-related or a homoiogue thereof or a polypeptide 
encoded then,by or a fragment or a variant of said nucleic acid 
molecule or said polypeptide or an effector of said nucleic acid 
molecule or said polypeptide, preferably together with 
pharmaoeutically acceptable carriers and diluents. 

The composition of claim 1, wherein the nucleic acid molecule is a 
vertebrate or insect Saposin-related nucleic acid. p.r«culary 
encoding the human homologs (such as human prosaposin (PSAP) 
human pulmonary surfactant-associated protein B (SFTPB) or 
human hypothetical protein FU40379). and/or a nucleic molecule 
wh,ch is complementary thereto or a fragment thereof or a variant 
thereof. 



The composition of claim 1 or 2, wherein said nucleic acid molecule 
IS selected from the group consisting of 

(a) a nucleic acid molecule encoding a polypeptide as shown in 
SEQ ID IMO: 2, 4, and/or 6, or an isoform, fragment or variant 
of the polypeptide as shown In SEQ ID NO: 2, 4, and/or 6 
and/or a nucleic acid molecule complementary thereto- 

(b) a nucleic acid molecule which comprises or is the nucleic acid 
molecule as shown In SEQ ID NO: 1, 3, and/or 5 and/or a 
nucleic acid molecule complementary thereto; 

(c) a nucleic acid molecule being degenerate with as a result of 
the genetic code to the nucleic acid sequences as defined in 
(a) or (b), 

(d) a nucleic acid molecule that hybridizes at SO^C in a solution 
containing 1 x SSC and 0. 1 o/o SDS to a nucleic acid molecule 
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as defined in claim 2 or as defined in (a) to (c) and/or a 
nucleic acid molecule which is complementary thereto; 

(e) a nucleic acid molecule that encodes a polypeptide which is 
at least 85%, preferably at least 90%, more preferably at 

s least 95%, more preferably at least 98% and up to 99,6% 

identical to the human PSAP, SFTPB, and/or FLJ40379, as 
defined in claim 2 or to a polypeptide as defined in (a); 

(f) a nucleic acid molecule that differs from the nucleic acid 
molecule of (a) to (e) by mutation and wherein said mutation 

10 causes an alteration, deletion, duplication or premature stop 

in the encoded polypeptide. 

4. The composition of any one of claims 1*3, wherein the nucleic acid 
molecule is a DNA molecule, particularly a cDNA or a genomic DNA. 

15 

5. The composition of any one of claims 1-4, wherein said nucleic acid 
encodes a polypeptide contributing to regulating the energy 
homeostasis and/or the metabolism of triglycerides. 

20 6. The composition of any one of claims 1 -5, wherein said nucleic acid 
molecule is a recombinant nucleic acid molecule. 

7. The composition of any one of claims 1-6, wherein the nucleic acid 
molecule is a vector, particularly an expression vector. 

25 

8. The composition of any one of claims 1-5, wherein the polypeptide 
is a recombinant polypeptide. 



9. The composition of claim 8, wherein said recombinant polypeptide is 
30 a fusion polypeptide. 



The composition of any one of claims 1 -7, wherein said nijcleic acid 
molecule is selected from hybridization probes, primers and 
anti-sense oligonucleotides. 

The composition of any one of claims 1-10 which is a diagnostic 
composition. 

The composition of any one of claims 1-10 which is a therapeutic 
composition. 

The composition of any one of claims 1-12 for the manufacture of 
an agent for detecting and/or verifying, for the treatment, alleviation 
and/or prevention of metabolic diseases or dysfunctions, for 
example, but not limited to, metabolic syndrome, obesity, diabetes 
mellltus, eating disorder, cachexia, hypertension, coronary heart 
disease, hypercholesterolemia (dyslipidemia), and/or gallstones, and 
others, in cells, cell masses, organs and/or subjects. 

Use of a nucleic acid molecule of the PSAP, SFTPB, and/or 
FU40379 gene family or a polypeptide encoded thereby or a 
fragment or a variant of said nucleic acid molecule or said 
polypeptide or an effector of said nucleic or polypeptide for 
controlling the function of a gene and/or a gene product which is 
influenced and/br modified by a Saposin-related homologous 
polypeptide. 

Use of the nucleic acid molecule of the PSAP, SFTPB, and/or 
FLJ40379 gene family or use of a nucleic acid molecule encoding 
Saposin-related or a homologue thereof or use of a polypeptide 
encoded thereby, or use of a fragment or a variant of said nucleic 
acid molecule or said polypeptide, or use of an effector of said 
nucleic acid molecule or said polypeptide for identifying substances 
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capable of interacting with a Saposin-related homologous 
polypeptide. 
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15 



16. A non-human transgenic animal exhibiting a modified expression of 
a Saposin-related homologous polypeptide. 

17. The animal of claim 16, wherein the expression of the 
Saposin-related homologous polypeptide is increased and/or 
reduced. 

18. A recombinant host cell exhibiting a modified expression of a 
Saposin-retated homologous polypeptide, or a recombinant host cell 
which comprises a nucleic acid molecule as defined in any one of 
claims 1 to 6. 

19. The cell of claim 18 which is a human ceil. 



20. A method of identifying a (poly)peptide involved In the regulation of 
energy homeostasis and/or metabolism of triglycerides in a mammal 

20 comprising the steps of 

(a) contacting a collection of (poly)peptides with a 
Saposin-related homologous polypeptide or a fragment 
thereof under conditions that allow binding of said 
(poly)peptldes; 

25 (b) removing {poly)peptides which do not bind and 

(c) identifying (poly)peptides that bind to said Saposin-refated 
homologous polypeptide. 

21 . A method of screening for an agent which modulates the interaction 
30 of a Saposin-related homologous polypeptide with a binding 

target/agent, comprising the steps of 
(a) incubating a mixture comprising 
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20 



25 



30 



(b) 



(c) 



(aa) a Saposin-related homologous polypeptide or a 
fragment thereof;. 

(ab) a binding target/agent of said Saposin-related 
homologous polypeptide or fragment thereof; and 

(ac) a candidate agent 

under conditions whereby said Saposin-related homologous 
polypeptide or fragment thereof specifically binds to said 
binding target/agent at a reference affinity; 
detecting the binding affinity of said Saposin-related 
homologous polypeptide or fragment thereof to said binding 
target to determine a (candidate) agent-biased affinity; and 
determining a difference between (candidate) agent-biased 
affinity and reference affinity. 

A method for screening for en agont, which modulates the activity 
Of a Saposin-related homlogous polypeptide, comprising the steps . 

(a) incubating a mixture comprising 

(aa) a Saposin-related homologous polypeptide or a 
fragment thereof; and 

(ab) a candidate agent 

under conditions whereby said Saposin-related homologous 

polypeptide or fragment thereof exhibits a reference activity 
(b) detecting the activity of said Saposin-related homologous' 

polypeptide or fragment thereof to determine a (candidate) 

agent-biased activity; and 
(o) determining a difference betwe«, (candidate) agent-biased 
activity and reference activity. 

23. A method of producing a composition comprising the (polWpeptide 
.dentified by «,e m«hod of claim 20 or the agent identified by the 
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method of claim 21 or 22 with a pharmaceuticaliy acceptable 
carrier, diluent and/or adjuvant. 

The method of claim 23 wherein said composition is a 
pharmaceutical composition for preventing, alleviating or treating of 
diseases and disorders, including metabolic diseases or 
dysfunctions, for example, but not limited to, such as metabolic 
syndrome, obesity, diabetes mellitus, eating disorder, cachexia, 
hypertension, coronary heart disease, hyper-cholesterolemla 
(dyslipidemia), and / or-gallstones, and other diseases and disorders. 

Use of a (poly) peptide as identified by the method of claim 20 or of 
an agent as identified by the method of claim 21 or 22 for the 
preparation of a pharmaceutical composition for the treatment, 
alleviation and/or prevention of diseases and disorders, including 
metabolic diseases or dysfunctions, for example, but not limited to, 
metabolic syndrome, obesity, diabetes mellitus, eating disorder, 
cachexia, hypertension, coronary heart disease^ hyper- 
cholesterolemia (dyslipidemia), and / or gallstones, and other 
diseases and disorders. 

Use of a nucleic acid molecule as defined in any one of claims 1 to 
6 or 10, use of a polypeptide as defined in any one of claims 1 to 6, 
8 or 9, use of a vector as defined in claim 7, use of a host cell as 
defined in claim 1 8 or 1 9 for the preparation of a pharmaceutical 
composition for the treatment, alleviation and/or prevention of 
diseases and disorders, including metabolic diseases or 
dysfunctions, for example, but not limited to, metabolic syndrome, 
obesity, diabetes mellitus, eating disorder, cachexia, hypertension, 
coronary heart disease, hypercholesterolemia (dyslipidemia), and / or 
gallstones, and other diseases and disorders. 



Use of a nucleic acid molecule of the PSAP, SFTPB, and/or 
FLJ40379 gene family or of a fragment thereof for the preparation 
of a non-human animal which over- or under-expresses the PSAP, 
SFTPB, and/or FLJ40379 gene product. 

Kit comprising at least one of 

(a) a PSAP, SFTPB, and/or FLJ40379 nucleic acid molecule or a 
fragment or an isoform thereof; 

(b) a PSAP, SFTPB, .and/or FLJ40379 amino acid molecule or a 
fragment or an isoform thereof; 

(c) a vector comprising the nucleic acid of (a); 

(d) a host cell comprising the nucleic acid of (a) or the vector of 
(b); 

(e) a polypeptide encoded by the nucleic acid of (a), expressed 
by the vector of (c) or the host cell of (a); 

(f) a fusion polypeptide encoded by the nucleic acid of (a); 

(g) an antibody, an aptamer or another effector against the 
nucleic acid of (a) or the polypeptide of (b) , (e) , or (f) and /or 

(h) an anti-sense oligonucleotide of the nucleic acid of (a). 
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Abstract 



The present invention discloses Saposin-related homologous proteins 
regulating the energy honneostasis and the metabolism of triglycerides, and 
polynucleotides, which identify and encode the proteins disclosed in this 
invention. The invention also relates to the use of these sequences in the 
diagnosis, study, prevention, and treatment of metabolic diseases and 
disorders. 
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Figure 3. Human Prosaposin (PSAP); surfactant, human piilmonaiy-associated protein B 
(SFTPB); and human hypothetical protein FLJ 40379 

Figure 3A. Human prosaposin (variant Gaucher disease and variant metachromatic 
leukocfystrophy, PSAP), Nucleic acid sequence (SEQ ID NO:l) 

si tltlllftf.t gcggagtcag acggtgctat gtacgccctc ttcctcctgg 

ill lltlalltll ^f^^r^srct ctagccggcc cggtccttgg actgaaagaa tgcaccag^ 
121 gctcggcagt gtggtgccag aatgtgaaga cggcgtcoga ctgcggggca gtgaagcact 
241 r.llrr^^^'" cgtttggaac aagccaacag tgaaatccct tc?c5S|ac SSglaS 
Iti tZf f^"^'' cgcagctggt gatatgctga aggacaatgc cactglggag gaga?cc?t? 
301 tttacttgga gaagacctgt gactggcttc cgaaaccgaa catgtcJSc? ??f?;caa™ 
361 agatagtgga ctcctacctc cctgtcatcc tggacatcat taalggagaa atgagc^J?? 
Ill tltl^tlt Stgctctgct ctcaacctct gcgagtctct ccagSgLc ctagcagagc 
til l^mttt saagcagctg gagtccaata agatcccaga gctggaLtg actga^tOT 
541 tggccccctt catggccaac atccctctcc tcctctaccc tcaggacggc ccccacaa^ 
601 agccccagcc aaaggataat ggggacgttt gccaggactg catlfagSg gtga??Sca 
5" iTalatalll ^.ItT^'''' aactccacct ttgtccaggc cttggtggaf SfJSfagS 
III J??c?«2!t ll^t t^^^'' cctggcatgg ccgacatatg caagaactat atcigccag? 
Ill l^l^t^l ""1^ atgatgatgc acatgcaacc caaggagatc tgtglgctgg 

Itl c^™^!? ^^f'^^^Sf^ aaagagatgc ccatgcagac tctggtcccc gLLagtgg 
901 cctccaagaa tgtcatccct gccctggaac tggtggagcc cattaagaag cacgaaotce 
961 cagcaaagtc tgatgtttac tgtgaggtgt gtgaattcct ggtgaaggag gSIcSaoc 
1021 tgattgacaa caacaagact gagaaagaaa tactcgacgc tttCgacaaS 
llll ttTr^^t gtccctgtcg gaagagtgcc aggaggtgjt S"!;;?!; SafScfa 
im '^'^^'^f ^^^'^ cctgctggag gaggtcagcc ctgagctggt Itgcagcatg c?gcJcctc? 
llll flcactgaccg ttcacgtgac LlgcLaaJ JaSgSgct 

1261 tctgcgaagt gtgcaagaag ctggtgggtt atttggatcg caacctggaS laSacfgca 

ti^rf^ r.^iu^ iit^i^ s™ 

: ti^^t ?dTH ssii 

t^^- ^i^- - ~c s s 

ESS f ^T- ~- ~s ssss? uh^ 
S iFF ---^ ™- =°s?cS 

2im gcctctcagt tgaacaaagc agcaaaacaa aggcagttt? 

2101 atatgaaaga ttagaagcct ggaataatca ggctttttaa atgatgtaat tScactqJa 
222^ ?cStS^II f^'^lt^'^^^t gcagctgctg gtggcttggg acJtcJgtgJ ggccaagggt 
llll It ^^''Z tggttcaact gtgatttggc tttcccgtgt ctttcctggt gatgccttS 
2281 ttggggttct gtgggtttgg gtgggaagag ggccatctgc ctgaatgfJa ?cScta«t 
lltl Tt:lll^tlt f ggcttgtgtg agcgtgtgga cagtgg?ggc cgc?ctg?gS 

24?1 lltlltf^tt tgcctacatg tccctggctg ttgaggcgct gcttcagcct gLLcISc 
2461 cttgtctcat agatgctcct tttgaccttt tcaaataaat afcggatggcg agctcctaoa 

-ss?- s— 5 '^li^fii i'SE 

^ 'i^j^. ^^ssss ~ S2^3 



Figure 3B, Human prosaposin (sphingolipid activator protein- 1), amino acid sequence (SEQ ID 
NO: 2). 



1 ityalfllasl Igaalagpvl glkectrgsa 
61 Ipcdickdw taagdmlkdn ateeeilvyl 
121 ikgemsrpge vcsalnlces Iqkhlaelnh 
181 pqdgprskpq pkdngdvcqd ci<^vtdiqt 
241 cknyisgyse iaicpninmhinq pkeicalvgf 
301 pikkhevpak sdvycevcef Ivkevtklid 
361 vdtygssils illeevspel vcsmlhlcsg 
421 raleknstkq eilaalekgc sflpdpyqkq 
481 acpsahkpll gtekciwgps ywc<^tetaa 



vwcqnvktas dcgavkhclq tvwnkptvks 
ektcdwlpkp nmsasckeiv dsylpvildi 
qkqlesnkip eldmtewap fmanipllly 
avrtnstfvq alvehvkeec drlgpgmadi 
cdevkempmq tlvpakvask nvipalelve 
nnktekeild afdkmcsklp kslseecqev 
trlpaltvhv tqpkdggfce vckklvgyld 
cdqfvaeyep vlieilvevin dpsfvclkig 
qcnavehckr hwm 



Figure 3C Human surfactant, pulmonary-associated protein B (SFTPB), nucleic acid sequence 
(SEQ ID NO: 3) 

1 atggctgagt cacacctgct gcagtggctg ctgctgctgc tgcccacgct ctgtggccca 
61 ggcactgctg cctggaccac ctcatccttg gcctgtgccc agggccctga gttctggtgc 
121 caaagcctgg agcaagcatt gcagtgcaga gccctagggc attgcctaca ggaagtctgg 
181 ggacatgtgg gagccgatga cctatgccaa gagtgtgagg acatcgtcca catccttaac 
241 aagatggcca aggaggccat tttccaggac acgatgagga agttcctgga gcaggagtgc 
301 aacgtcctcc ccttgaagct gctcatgccc cagtgcaacc aagtgcttga cgactacttc 
361 cccctggtca tcgactactt ccagaaccag attgactcaa acggcatctg tatgcacctg 
421 ggcctgtgca aatcccggca gccagagcca gagcaggagc cagggatgtc agaccccctg 
481 cccaaacctc tgcgggaccc tctgccagac cctctgctgg acaagctcgt cctccctgtg 
541 ctgcccgggg ccctccaggc gaggcctggg cctcacacac aggatctctc cgagcagcaa 
601 ttccccattc ctctccccta ttgctggctc tgcagggctc tgatcaagcg gatccaagcc 
661 atgattccca agggtgcgct acgtgtggca gtggcccagg tgtgccgcgt ggtacctctg 
721 gtggcgggcg gcatctgcca gtgcctggct gagcgctact ccgtcatcct gctcgacacg 
781 ctgctgggcc gcatgctgcc ccagctggtc tgccgcctcg tcctccggtg ctccatggat 
841 gacagcgctg gcccaaggtc gccgacagga gaatggctgc cgcgagactc tgagtgccac 
901 ctctgcatgt ccgtgaccac ccaggccggg aacagcagcg agcaggccat accacaggca 
961 atgctccaro cctgtgttgg ctcctggctg gacagggaaa agtgcaagca atttgtggag 
1021 cagcacacgc cccagctgct gaccctggtg cccaggggct gggatgccca caccacctgc 
1081 caggccctcg gggtgtgtgg gaccatgtcc agccctctcc agtgtatcca cagccccgac 
1141 ctttga 



Figure 3D. Human pulmonary surfectant-associated protein B, 18kD, amino add 
(SEQ ID NO: 4) 

1 maeshllqwl llllptlcgp gtaawttssl acaqgpefwc qsleqalqcr alghclqevw 
61 ghvgaddlcq ecedivhiln kmakeaifqd tmrkfleqec nvlplkllinp qcnqvlddyf 
121 plvidyfcpiq xdsngicinhl glcksrqpep eqepgmsdpl pkplrdplpd plldklvlpv 
181 Ipgalqarpg phtqdlseqq fpiplpycwl cralikriqa mipkgalrva vaqvcrwpl 
241 vaggicqcla erysvilldt llgrmlpqlv crlvlrcsmd dsagprsptg ewlprdsech 
301 Icmsvttqag nsseqaipqa mlqacvgswl drekckqfve qhtpqlltlv prgwdahttc 
361 qalgvcgtms splqcihspd 1 



Figure 3E. Human hypothetical protein FLJ40379 (TENSEMBL Accession Number 
ENSG00000173005), nucleic acid sequence (SEQ ID NO: 5) 

1 ccatacatgc ttctggcagt gttgcacagc gttgcacagc ttggcggcct cctggctcct 
61 gcaccagaag cttgggceca gggcacactg gtcggtgccc agcagtgggg tcctggggcc 
121 gtggcaggcc cccaccttct tgcacacagc cacggggtcc atcatgtcct tgagaclStc 
181 aatgagcacg ggctcgtact gggtgacgaa gtgcttgcac tggatcatat agggcagcgg 
241 caggatgctg cagccaccct tgaaggccac caggatgtct cgcttggtgc tct?gctctl 
301 caagttgtgg gaggacaccg tgagcagcct cttgcaccca ttgcagaagc tgccctggtt 
361 ctccgcgtcc cactctgggg acggcacgat ggcataggca tcatggactg cccgggcccg 
m ff^^^^^gttg ccacacagac ggatgaactt gcacaccttc tctggggtga ttttggccac 
481 aagctgcacc aaggaggggc tgtaggtgtc caccaagatg atgcactcct tcgtgltaga 
541 ggcaggcatt accgagcaca cgcgctccag ggcatgggtg atcatgagct cagagctgtt 
^^S'^?^^^'^^ gcttctgcac cacgttcatg cacacctlac aggtLcIcc 
661 ggccttcatc tgcatctcgc tctgtttcct tggcaacccc agctccaggg agggaacccc 
ill ct^^o^ir^^ actacttgag tcaaacgggc aggtgcccct agctcctcll agS^?ccccc 
III ^^"^^^^^^^Sf agctcctgcg gggggagaag cctcagtgct tggtcagcag ggacaaaaaa 
Itl ltf,^^t^t^^ tagttcttgc agaggacggc caggccaggc cccaaggact cacactgctc 
901 ctggatgttc aagtcggcca aggtcaagtt ggaccggaca gcctcctgga gtcgggagac 
961 ctgccgtaca cagtcttggc acagagctcc ttcaggcgcc tggcggg^lt SgalStfag 
1021 gggcccattg gccatgaacg gagccacagc ctcaaaggtg tcctctttgg agagtggcc? 
1081 cagggtggcc aggtgcctct gcagcggctc acagaggctg agcgctgtgS aca?c??tgc 
llnl 'l^f^^^'^ll^ tccggggccc cacggagcat gctcaggatg gccgaactgt gggcatccac 
1201 catccacttg catccggctg aagactcctg gctggggagc cactcacajg tlStcatcJc 
1261 caaagccagg atgtcagact ccgtggcgtc agggttcagc ccattgcclj cggcggc^Jc 
llsl SaScaotff agggcagaga cttcgcggtg ggtttgttcS aJIcgJcS 

1381 ttggcagtac cccacagccc cgcacctggc agctgtctgc agatcctgac accacaccgt 
1441 ggagcccttt gcacactcct gggggcctga ggtggggctg gccctgg?gg cccccaggag 
1501 gctgggcagg aggagcaggg c. *.»-c»ggag 



^SSon^^'-jnSSr ^yP°*f<^ FU40379 (ENSEMBL Accession Number 

ENSP00000308224), ammo acid sequence (SEQ ID NO: 6) 

^2" ^^,^PSLLG ATRASPTSGP QECAKGSTVW CCDLQTAARC GAVGYCQGAV WNKPTAKSLP 
61 CDVCQDIAAA AGN6LNPDAT ESDILALVMK TCEWLPSQES SAGCKWM^A HSSAILsSr 
III S^^^^ CTALSLCEPI. QRHTATLRPL SKEOTFE^VA PfS^J SSSS 
181 CQDCVRQVSR LQEAVRSNLT LADUJIQEQC ESLGPGLAVL CKNYLFQFFV pSo^RLLP 
241 PQELCRKGGP CEELGAPARL TQWAMD6VP SLELGLPRKQ SmSgVT ^^S^JSk 
301 LDHWLMSNSS ELMITHALER VCSVMPASIT KECIILVotS SPSL^vS JSSS? 
361 RLCGNRRRAR AVHDAYAIVP SPEWDAEMQG SPCNGCKRLL tvss^^ sSSSJ 
421 KGGCSII,PI.P YMIQCKHFVT QYEPVLIBSL KDMMDPVAvS SSgS^^ J^SSI 
481 L6PSFWCRSQ EAAKLCNAVQ ITCQKHVW -"^^v^rw ifia^eTDQCA 



Figure 3G. Multiple Sequence Alignment (ClustalW 1 .83) 
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MYAI/FIiLASLI/SAAIiAGPVIiGIiKTCTRGSAVWCQNW 

ALLLLPSLLGATRASPTSGPQECAKGSTVWCQDIiQTAARCGAVGYCQ 

MAESHLIiQWLLLLliPTLCGPGTAAWTTSSIiACAQGPEFWCQSLBQALQCRAIiGHCI* 

MERAGLLAVLALCCA FGVPAAATPLLGSSKCTWGPSYWCGNFSNSKECRATRHCI 

QTVWNK- - - PTVKSLPCDICKDVVTAAGDMLKDNATEEEIIiVYLEKTCDWIiPKPK^ 

GAVWNK PTAKSLPOJVCQDIAAAAGNGIiNPDATESDIIALVMKTCEWLPSQESSAGC 

QEVWGH VGADDL-CQECEDIVHILNKMAKEAIFQDTMRKPLEQEam,PLKLU^ 

QTWETQKVPVDTDSICTICKDMVTQARDQI,^ 

KErVDSYLPVILrDI IKGEMS-RPGEVCSALNLCESIi 

KWMVDAHSSAIIiSMLRGAPDSAPAQVCTALSLCEPl. 

NQVLDDYFPLVIDYFQNQID — SNGICMHLGLCKSR 

IKVADDFLPBLVEALASQMN- -PDQVCSVAGLCNSARIDEIiYKNGlQAGLDGTVQNEDDS 

QKHIAEOmQKQLESNKIPELDMTEWAPFMANIPIiLLYPQDGPRSKPQ 

QRHLAT IiRPLSKEDTFEAVAPFMANGPLTFHPRQAP 

QPEPEQEP GMSDPLPKPLRDPLPDPLLDKIiVIiPVLPGALQ 

SEETELAMQPNQIiSCGNCNLLSRIiMHSKFAATDRDDMVETMIiHMCGSLS 



PKD- 



-NGD 
"EGA 



TYFNDIYDHVSKHLTTDAVCHVSGVCASRYHQHEEEKQP— QEALVALDAG DDI 

VCQDCIQMVTDIQTAVRTNSTFVQALVEHVKEECDRLGPGl^ICKKn^ISQYSEIAIQm 

LCQDCVRQVSRLQEAVRSNLTLAD LNIQEQCESLGPGLAVLCKNYLFQFFVPADQAL 

ARPGPHTQDIiSEQQFPIPLP 

PCEriCEQLVKHLRDVIArANTTETEFK--QVMEGFCKQSKGFKDECLSIW 



MHMQ PKEICALVGFCD- - 

RLLP PQELCRKGGPCEE- 



-EVKEMPMQTL 
-liGAPARLTQV 



VSKIiD--AN6ACCMIGICQKNSAS--SMKDVPIMPLLPVIEPAQVKlTIEKI,BKH^ 

VPAK-VASKNVIPALELVEPIK KHEVPAKSDVYCEVCEPLVKEVTKLIDNNKTEKE 

VAMD GVPSLELGLPRKQS EMQMKAGXH'CEVCMNVVQKLDHWLMSNSSELM 

YCWLCRALIKRIQAMIPKG A 

GASEPKFSQQEILDMQLPIDHLMGAANPGALVEGGEIiCTIiCEYMLHPIQETLATPSTO 

ILDAFDKMCSKLPKSLSEECQEWDTYGSSIluSILLEEVSPELVCSMLHLCSG 

ITHALERVCSVMPASITKECIILVDTYSPSLVQLVA-KITPEKVCKFIRLCGNRRR 

LAVAVAQVCRVVPLVAGGICQCLAERYSVILLDTLLGRMLPQLVCRLVIJlCSlfflJD 

IKHTVENICAKLPSGVAGQCRNFVEMYGDAVIALLVQGIJSrPRD^ 



TRLPALTVHVTQPKDGGFCEVCKKLVGYIiDRWI,EKN 

ARAVHDAYAIVPSPEWDAENQGSFCNGCKRLLTVSSHNLESK 

AGPRSP TGEWliPRDSECHLCMSVTTQAG N 

-KKEDVEVPNPQPAS DEQDPPTCPLCLPAVEQAQMKIRDN 



STKQEIIAAIiEKGCSFLPDPYQKQCDQPVAEYEPVLIEILVEVMDPSPVCLKIGACPSAH 
STKRDIIiVAPKGGCSILPLPYMIQCKHFVTQYEPVLlESLKDMMDPVAVCKK^ 

SSEQAIPQAMLQACVGSWLD-REKCKQPVEQHTPQLLTLVPRGWDAHTTCQALGVCGTMS 
KSKDNIKKVLNGLCSHLPiroiKEEC\mFVNTYSNELIDM^ 



PSAP Hs . KPI,IiGT EKCXW 

FIiJ40379 Hs TPLLGT DQCAL 

SFTPB Hs SPLQ ^jjj 

Sap-r Dm YALWDLRISIiEDDVDGEDK SSSEEISnjODIESIiEELPPQLAFDPGPTAAPNCLI 

PSAP Hs GPSYWCQOTETAAQCNAVEHCKRHVWN 

FLJ40379 Hs GPSFWCRSQEAAKLCNAVQHCQKHVW 

SFTPB Hs SPDL : . 

Sap-r Dm CEELVKTLEKRMGKHPTRDSIKHILEESCDRMRKPtO^ 

PSAP Hs . 

FLJ40379 Hs ZZZ 

SFTPB Hs "III 

Sap-r Dm PKLICTELGMCIIAD--l,DDLEVDEALKYDVn:ALPRQDNKLS SSIKEPPTCVLCE 

PSAP Hs 

FIiOr40379 Hs IIIIIIIZ 

SFTPB HS mill 

Sap-r Dm FIMTKI.DADr,KNKTEQDDIKRAIEAVasiIU^PATVRKQCD^ 

PSAP Hs 

PLJ40379 Hs ' IIIIIIIIII 

SFTPB Hs IIIIIIII 

Sap-r Dm VCQKLQLCPSVAVTD ESTLECGVCHGVTQALLPFLREKKDNVSEVTAI-QMrSVGCE 

PSAP Hs 

FIiJ40379 Hs I-IIIII 

SFTPB Hs III 

Sap-r Dm nlpakyykicsemisiygssiknlakrpyidqshicaeigkcfeseksslafarisa 
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FIGURE 5. Quantitative analysis of prosaposin expression in human abdominal 
adipocyte cells during the differentiation from preadipocytes to mature 

adipocytes 
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